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Abstract 

We consider the overdamped limit of two-dimensional double well systems per- 
turbed by weak noise. In the weak noise limit the most probable fluctuational path 
leading from either point attractor to the separatrix (the most probable escape path, 
or MPEP) must terminate on the saddle between the two wells. However, as the param- 
eters of a symmetric double well system are varied, a unique MPEP may bifurcate into 
two equally likely MPEP's. At the bifurcation point in parameter space, the activation 
kinetics of the system become non-Arrhenius. We quantify the non-Arrhenius behavior 
of a system at the bifurcation point, by using the Maslov-WKB method to construct 
an approximation to the quasistationary probability distribution of the system that is 
valid in a boundary layer near the separatrix. The approximation is a formal asymp- 
totic solution of the Smoluchowski equation. Our analysis relies on the construction of 
a new scaling theory, which yields 'critical exponents' describing weak-noise behavior 
at the bifurcation point, near the saddle. 
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1 Introduction 



In this paper we build on previous work p8| , |29| , |30|j by analysing an unusual bifurcation phe- 



nomenon in the theory of noise-activated transitions. We study its appearance in the over- 
damped limit of two-dimensional double well systems, with nongradient dynamics. In this 
context, the new phenomenon is a bifurcation of the most probable transition path (in the 
limit of weak noise) between the two wells, as a system parameter is varied. 

In many ways, the behavior of a system whose most probable transition path is just 
beginning to bifurcate resembles that of a system undergoing a phase transition. In par- 
ticular, double well systems which are 'at criticality' in the bifurcation sense will exhibit 
non-Arrhenius behavior. This means that the growth of the mean time between inter-well 
fluctuations, i.e., the growth of the mean time needed for the system to hop from one well 
to the other, will not be pure exponential in the weak-noise limit. In double well systems 
at criticality, relaxation due to activation will proceed (in the limit of weak noise) at an 
anomalous, in fact anomalously large, rate. 

To treat the previously unnoticed phenomenon of bifurcation, we need to develop a new 
approach for treating transitions induced by weak noise, when a 'soft mode' appears in the 
dynamics of transverse fluctuations around the most probable transition path. Since this is 
analogous to a phase transition, we introduce a scaling theory. In the context of double well 
systems, our scaling theory is a theory of behavior near the saddle point between the two 
wells, since the saddle is where the most probable inter-well transition path begins to bifur- 
cate. We shall demonstrate that the theory explains the weak-noise behavior, at criticality, 
of a large universality class of double well systems. 

The scaling theory will reveal a striking feature of the bifurcation phenomenon, which is 
that in any 'critical' double well system there appears (in the weak-noise limit) a nongeneric 
singularity in the stationary probability distribution, located at the saddle point. As Berry [§J 
discusses, a singularity is nongeneric if it arises, in an appropriate WKB sense, from a 
catastrophe of unusual type; i.e., one of infinite codimension. The stationary distribution 
near the saddle point is described, in the limit of weak noise, by an unusual (non-canonical) 
diffraction function. The familiar special functions of WKB theory (Airy functions, Pearcey 
functions, etc.) do not suffice. The singularity at the saddle, and the diffraction function 
with which it is 'clothed,' can be viewed as the mathematical source of the non-Arrhenius 
weak-noise asymptotics. 

We begin with three largely qualitative sections. In Section ^ we review the physical 
relevance of overdamped models with non-gradient dynamics, and in Section |3] explain how 
the weak-noise behavior of any double well model of this type is determined by its flow field 
of instanton trajectories (most probable fluctuational paths). In Section f| we sketch the 
gross features of the bifurcation phenomenon, including features such as further bifurcations 
and universality. In Section |5|, our treatment becomes more quantitative. We first review 
the matched asymptotic approximations technique we have employed elsewhere |28|, |50|| , 
and begin extending it to handle models with singularities. In Section |5.3| we explain why 
the bifurcation transition deserves to be called a phase transition. In particular, we explain 
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how behavior near criticality is described by critical exponents, which characterize the rate 
of divergence of measurable quantities (e.g., the pre-exponential factor in the weak- noise 
asymptotics of the mean inter- well fluctuation time). In Section |5.4| we explain how to 
determine whether any given double well model is 'critical.' The transverse Jacobi operator 
is the differential operator appearing in the second variation of the Onsager-Machlup action 
functional, when one varies about the most probable inter-well transition path. The onset 
of bifurcation occurs when this operator acquires a zero eigenvalue. 

In Section |6] we explain the use we shall make of Maslov's geometric theory of wave 
asymptotics [§, |32j. In Section [7| we introduce the concept of a scaling theory, by de- 
veloping a scaling theory of weak-noise behavior near generic (cusp) singularities. We show 
how the scaling theory justifies the Ginzburg-Landau approximation used in this context by 
Dykman et al. ||12|| . In Section [8] we develop an analogous scaling theory for the nongeneric 
singularity associated with the onset of bifurcation. We compare our theory with numerical 
data, and examine its predictions for non-Arrhenius behavior and the stationary distribution 
near the saddle point. 

In Section |9] we discuss our results. The reader may wish to glance ahead at Fig. [TJ], which 
is an Arrhenius plot of the inter-well hopping rate of any double well system at criticality. 
The non-Arrhenius behavior shown there, in particular the 'logarithmic bend,' is the key 
result of this paper. 



2 Preliminaries 

Statistical physics and chemical physics include many examples of stochastically perturbed 
dynamical systems. It is often the case that the state of such a system is modelled as a 
particle moving in an n-dimensional force field F(x), and subject to additional random 
perturbations ('noise'). Since our interest is in the modelling of nonequilibrium systems, 
we shall not assume (as is usually done) that this force field is conservative. 

If the motion of the particle is isotropically damped, with damping constant 7, in the 
absence of noise the particle position x would obey the deterministic equation 

mx + 'jmx = F(x). (1) 

Adding a random force F ra _ ndoin (t) yields the Langevin equation 

mx + jmx = F(x) + F random (t). (2) 

In physical problems F ran(iora (t) is often modelled as Gaussian white noise with amplitude 
y/2'ymkBT, where T is the ambient temperature. In this case the associated partial differ- 
ential equation, which describes the time evolution of the probability density of x and its 
velocity, is known as the (forward) Fokker-Planck equation. 

A case particularly important in applications is the overdamped, or inertialess case, when 
7 ^> t$ , for to the physical time scale. In this case the mx term in (0) can be dropped, 
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and the Langevin equation becomes first order in time. If time is rescaled by a factor 7m 
(i.e., t <— 'jmt), it may be written in the normalized form 

x = u(x) + e 1/2 w(t). (3) 

Here w(t) is a standard n-dimensional Gaussian white noise (the derivative of w(t), a stan- 
dard n-dimensional Wiener process), the 'drift field' u equals F, and e equals 2k bT. The 
corresponding scalar advection-diffusion equation for the probability density p = p(x, t) of x, 

p=(t/2)V 2 p-V-(pu), (A) 

is known as the (forward) Smoluchowski equation. It may be written as p = —C*p, where 

C = -(e/2)V 2 + u- V + V-u. (5) 

It is often necessary to generalize the equation to include the effects of anisotropic damp- 
ing or state-dependent noise | 23fl . However, in this paper we consider only overdamped 



systems whose Langevin equation is of the form @. Since we do not require the deter- 
ministic forces to be conservative, we do not require u to be a gradient field. This means 
that even in stationarity, the system may not display detailed balance. Equivalently, the 
stationary probability distribution for the system may not be (in the traditional sense) in 
thermal equilibrium. 

Attractors of the drift field u, in particular point attractors, correspond to 'metastable 
states': they are stable states of the underlying deterministic dynamics, but the thermal 
noise may induce transitions between them. Of great physical interest is the time needed 
for this to occur. For example, how long does it take for the noise in (|3]) to overcome the 
drift toward a specified stable point S, and drive the system state x beyond the domain of 
attraction of S, toward another attractor? The study of such noise-activated transitions is 
known as the stochastic exit problem, or the escape problem. For general stochastic models 
only numerical results can be obtained (see, e.g., Ref. |7J). The Smoluchowski equation is 
particularly difficult to handle in the e — > limit. This is the weak- noise, or low-temperature 
limit, in which the mean first passage time (MFPT) (r) from S to the boundary of its domain 
of attraction grows exponentially. In this limit a single escape path (the most probable escape 
path, or MPEP) usually dominates. Our approach to the weak-noise limit, which does not 
rely on a numerical simulation of the Smoluchowski equation, will exploit this asymptotic 
determinism quite heavily. 



3 Symmetric Double Well Models 

As in two of our earlier papers on the stochastic exit problem PS| , ^J, we shall focus on 
two-dimensional 'double well' systems, with smooth drift field u = (u x ,u y ) of the symmetric 
form shown in Fig. |l|. If x = (x,y) is the two-dimensional state variable, u x (x,y) is taken 
to be odd in x and even in y, while for u y (x,y) the reverse is true. There is assumed to be 
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Figure 1: The streamlines of a typical symmetric double well drift field, indicating the path 
taken by the particle in the absence of noise. There are point attractors at S — (x s , 0) and 
S' = (— x s ,0), and a saddle point at (0,0). 



a linearly stable point attractor S = (x s , 0) whose domain of attraction is the entire open 
right-half plane. By symmetry, its reflection S' = (— x s , 0) attracts the open left-half plane. 
There is also assumed to be a single saddle, or hyperbolic point, on the ?/-axis separatrix 
between the two domains of attraction. It must be at the origin, by symmetry. Nongradient 
drift fields with this topology arise in statistical and chemical physics, and also in theoretical 
biology, e.g., in stochastic competition models of population dynamics |JT . 



One expects that as e — > 0, exit from either of the two domains of attraction will occur 
preferentially over the saddle. The drift field u is assumed to have a nondegenerate lin- 
earization at the saddle. So \ x = du x /dx(0,0) > 0, and = du y /dy(0, 0) < 0. We shall 
see that the character of the abovementioned bifurcation phenomenon depends strongly on 
the quotient /i = \X y \/X x . 

A typical (and not necessarily gradient) symmetric double well drift field, which we have 
used elsewhere for purposes of illustration and shall examine further below, is 

u x (x,y) = x — x 3 — axy 2 , 

u y (x,y) = ~{i(l + x 2 )y, (6) 

in which fi appears as a parameter. We shall call this drift field the 'standard' double well 
model. For any choice of \i > 0, its structure is that of Fig. [I], with S = (x s ,0) = (1,0). 
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It is not a gradient field unless the parameter a equals /i. If a > it has a very significant 
additional property, which we shall require of all our double well models. This is the property 
that d 2 u x /dy 2 (x, 0), which by symmetry is an odd function of x, is strictly negative for all x 
between and x s . If this is the case, the drift from the saddle toward S 'softens' as one 
moves away from the x-axis. The off-axis softening, for the standard model, increases as a is 
increased. 

We remind the reader of our approach to the weak-noise limit of stochastically perturbed 



dynamical systems. (We review the mathematical aspects in Section 5T and |5.2| .) Suppose 
that such a system has a unique stationary probability density po, which satisfies the time- 
independent Smoluchowsk equation C*po — 0. Typically, as the noise strength e — > 0, 
Po takes on an asymptotic WKB form. In fact 

p (x) ~ K{x) exp[-W{x)/e], e - 0, (7) 

for certain functions W and K whose smoothness properties we shall leave unspecified; 
K, in particular, may have singularities. In any double well model, by convention W = 
and K — 1 at x — S and S'. Moreover, W > at all points x other than S and S'. W is 
called the nonequilibrium potential of the model []IE |. If the drift u equalled the negative 



gradient of a potential $, then W would equal 2$, K would reduce to a constant, and 
the WKB form (|7p would reduce to a Maxwell-Boltzmann distribution. For systems with 
nongradient dynamics, the computation of W and K is more complicated. 

In general W has an alternative interpretation as a classical action function. As we 
review in Section |5.1| , this is because the WKB approximation (|7|) is determined by a flow 
field of 'classical' trajectories, or WKB characteristics, emanating from the attractors of 
the deterministic dynamics (e.g., S and 5"). These classical trajectories (sometimes called 
instanton trajectories || |28|, or optimal trajectories [|l!|) have a physical interpretation as 
most probable fluctuational paths. In the double well case, the exponentially rare fluctu- 
ations from S (resp. S') to any point x in its domain of attraction become increasingly 
concentrated around the classical trajectory extending from S (resp. S') to x. Equivalently, 
the most probable 'prehistory' of any fluctuation passing through x extends back toward 
S or 5" along this trajectory | IT| . The trajectories are determined by a classical Lagrangian 



(the Onsager-Machlup Lagrangian), and W(x) is obtained by integrating this Lagrangian 
along the classical trajectory terminating at x. W(x) is interpreted as the rate at which 
fluctuations to the neighborhood of x are suppressed exponentially, as e — > 0. 

In symmetric double well models the stationary density p (and hence W) must be even 
in x. In the e — ► limit the phenomenon of noise-activated hopping between the two wells 
is governed by the closely related quasistationary density p\, which is odd rather than even. 
The quasistationary density is the next lowest lying (i.e., slowest decaying) eigenmode of 
the Smoluchowski operator C*. For any choice of initial conditions the probability density 
p = p(x, t) necessarily satisfies 

p(x,t) ~ p (x) +Cpi(x)exp(-X 1 t), t — > oo (8) 

for some constant C, where Ai is the eigenvalue of p\. The exponential decay of the quasi- 
stationary eigenmode is interpreted as describing the equilibration of probability between 
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the two wells due to noise-activated hopping, or the absorption of probability on the sepa- 
ratrix J7| . pi of course satisfies Dirichlet (absorbing) boundary conditions on the separatrix. 
Its eigenvalue Ai = Ai(e) normally falls to zero exponentially as e — > 0. The exponentially 
small splitting between the ground state eigenvalue Ao = and the eigenvalue Ai is anal- 
ogous to the exponentially small splitting (as h — > 0) between the ground state and first 
excited state of a quantum-mechanical Hamiltonian with double well potential. Both are 
WKB phenomena. In the e — > limit, Ai is interpreted as the rate at which noise-activated 
hopping takes place. Equivalently, it is a reciprocal MFPT . 

The techniques reviewed in Section [5.1| and [5.2| permit a computation of the e — > 
asymptotics of the eigenvalue Ai, and hence of the MFPT (r), in most symmetric dou- 
ble well models. Our basic approach is similar to that of Kramers p2| . In the limit 
of weak noise we approximate p x {x,y) by po(x, y) sgn(x) except in a 'boundary layer' of 
width 0(e l l 2 ) near the x = separatrix, and compute Ai as the rate at which proba- 
bility is absorbed on the separatrix. Performing this computation requires the construc- 
tion of a boundary layer approximation to p\, valid near the saddle, and matching to the 
'outer' approximations on either side M. Normally, we find (r) ~ Aexp[+W(0, 0)/e], where 
A oc K(0, 0) _1 . So the asymptotic MFPT growth rate in the limit of weak noise is simply 
AW = W(0, 0) — W(S) = W(0,0), the height of the 'action barrier,' or activation barrier, 
between the two wells. And the MFPT generally displays a pure exponential (Arrhenius) 
growth, with an explicitly computable (e-independent) prefactor. We shall see, however, 
that the bifurcation phenomenon may induce more complicated (non- Arrhenius) weak-noise 
asymptotics for the MFPT. 



The Bifurcation Phenomenon: Qualitative Fea- 
tures 



We pointed out in Ref. |28] that a bifurcation phenomenon may occur in double well models 



as their parameters are varied. Figure |2| displays the flow of instanton trajectories (i. e., most 
probable weak- noise fluctuational paths) emanating from the stable point S — (1,0) in the 
standard model (|]) with \i — 1, at several values of the parameter a. When < a < 4 the 
general picture resembles Fig. ^|(a): the line segment from (1,0) to the saddle (0,0) is the 
only instanton trajectory from 5* to the saddle. This line segment is interpreted as the most 
probable escape path (MPEP). In the weak-noise limit, the (exponentially rare) fluctuations 
from the right-half plane to the left-half plane proceed preferentially along it. To leading 
order, activation kinetics reduce to instanton dynamics. 

As a is increased, there is a qualitative change, akin to a phase transition, in the behavior 
of the instanton trajectories. This takes place at the critical value a = 4, as shown in 
Fig. ||(b). When a > 4 as in Fig. @(c), they focus at a point (xf, 0) on the x-axis, with Xf > 0. 
Xf converges to zero as a — > 4 + , so one may speak of the focal point 'being born' at criticality, 
and 'emerging from the saddle' as a is increased above its critical value. In geometrical optics 
the focal point would be called a cusp. From it there extends a fold, or caustic (an envelope 



7 




Figure 2: The flow field of outgoing instanton trajectories (i.e., most probable fluctuational 
paths, in the weak-noise limit) emanating from the stable point S = (1,0) of the standard 
double well model (^). Here \i = 1, and parts (a), (b), (c), (d) of the figure illustrate the 
cases a = 1,4,5,10. The a = 4 model is 'critical' in the bifurcation sense. Increasing a 
above 4 causes the instanton trajectories to focus, and the MPEP to bifurcate. 
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of crossing trajectories, with Ay oc (Ax) 3 / 2 ). 

Each point in the sharp-tipped region within the fold is reachable from S via three 
instanton trajectories. Of these three trajectories, only the one(s) with minimum action 
are 'physical,' and can be interpreted as most probable ffuctuational paths. For example, 
on-axis points (x, 0) with < x < x/ are reachable via an on-axis (straight) trajectory, and 
via two additional symmetrically placed off-axis (curved) trajectories. Computation shows 
that the off-axis trajectories have lesser action, and are dominant. The true ('least action') 
MPEP's in Fig. 0(c) are accordingly the symmetrically placed pair of off-axis trajectories, 
one above and one below the x-axis, that terminate on the saddle. Note that beyond the 
cusp (i.e., at x < Xf), the physical action W is no longer different iable through the x-axis. 
This nondifferentiability arises from different dominant off-axis trajectories being selected 
as y — > + and y — > 0~. 

The transition at a = 4 can be interpreted as a bifurcation of the MPEP, corresponding 
to a sort of symmetry breaking. At larger values of a, the drift field u and the Langevin 
equation @ remain symmetric about the x-axis, but each of the two MPEP's is not. The 
line segment from S to the saddle, formerly the (unique) MPEP, in no way contributes to the 
leading weak-noise asymptotics for escape. (It remains an extremum of the Onsager-Machlup 
action functional, but is no longer the minimum.) 

The occurrence of a bifurcation in the standard model at sufficiently high a (when /i = 1, 
at a = 4) is due to the fact that by increasing a, one softens the resistance to motion toward 
the separatrix in the vicinity of the x-axis (though not on the x-axis itself). This enhances 
the probability of escape trajectories that deviate from the axis. Of course it is only in the 
limit, as e — > 0, that well-defined MPEP's appear. And the existence of a sharp, well-defined 
transition when a equals some critical value a c is not at all obvious! 

When a is increased beyond a c , further bifurcations of the on-axis instanton trajectory 
will occur. In Section [5]4| we explain how the critical values of a are determined by a Jacobi 
equation, with a classical mechanical interpretation. It turns out that in the standard model 
with ii—l the j'th bifurcation occurs at a = a® = (j + 1) 2 . Figure 0(d) shows the situation 

(2) 

at a = 10, when a second focus (xj , 0) has emerged, with its own caustic. Beyond the first 
focus (x/,0) each point on the x-axis is reached from S by three instanton trajectories; 
beyond the second focus, each such point is reached by five. The MPEP's in Fig. 0(d), 
however, remain the symmetrically placed pair of off-axis trajectories that terminate on the 
saddle. Computation shows that the oscillatory trajectories from S to the saddle arising 
from the second, third,... bifurcations have higher actions, and are accordingly not physical. 
That caustics can occur in the flow pattern of the most probable fluctuational paths has 



been known for some time || [19|], but our Ref. |28| was the first to consider the effects on 
exit phenomena. We shall see that what occurs at the first critical value of a has much in 
common with a critical point characterizing a phase transition in a condensed matter system. 
This is suggested by Fig. [| which plots the activation barrier AW = W(0, 0), as determined 
by the true MPEP or MPEP's, as a function of a for the standard model with fi = 1. (Recall 
that AW = W(0, 0) is the exponential growth rate of the MFPT as the noise strength tends 
to zero.) W(0, 0) decreases above a = a c = 4 as the bifurcating MPEP's move away from 
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Figure 3: The activation barrier AW = W(0, 0) - W(S) = W(0, 0) between the two wells 
of the standard double well model (^), as a function of the off-axis softening parameter a. 
Here /i = 1. The lowering of the activation barrier beyond a = a c = 4 is due to the 
bifurcation of the MPEP, along which the action difference AW is computed. 
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the x-axis. The WKB prefactor K (0, 0) turns out to be singular at the bifurcation transition; 
in Section |0| we note that it diverges as a — > a~. As a consequence, in the standard model 
at least, the weak-noise MFPT asymptotics at criticality cannot be of a pure Arrhenius form. 

There is in fact a set of critical exponents describing the behavior of the e — > asymptotics 
of the standard model as a tends to its (p-dependent) first critical value a c = a£\ and as 
x — > 0, y — *■ 0. It is a reasonable conjecture that behavior near criticality is universal in the 
sense that it does not depend on the details of the stochastic model exhibiting the bifurcation 
phenomenon. To analyse the critical behavior and demonstrate universality, in Sections |^ 
and H we begin the construction of a scaling theory of the bifurcation phenomenon. Our 
treatment extends from the standard model (^ to any symmetric double well model with 
a similar 'off-axis softening parameter' a, and a first critical value a c . We first identify 
the singular behavior, for any double well model at criticality, of the action W and the 
WKB prefactor K at the saddle point. We then show that at criticality, the stationary 
density p an d the quasistationary density p\ may be approximated on an appropriate (e- 
dependent) length scale near the saddle point by certain 'diffraction functions,' which have 
explicit integral representations. The technique for constructing these representations is due 



to Maslov [32], and ultimately to Keller p0[ . It was Maslov who first worked out, in the 
context of wave fields, the diffraction functions that 'clothe' generic singularities other than 
cusps and folds. 

A very important discovery, from a mathematical point of view, will be that when a equals 
the critical value a c where the MPEP begins to bifurcate, the saddle point (0, 0) acquires 
a certain nonzero singularity index. What this means is best understood by comparing 
the singularity at the saddle (when a = a c ) with the cusp and fold singularities present 
when a > a c . The terminology of geometrical optics || is appropriate. The cusp at (x/,0) 
is a structurally stable singularity (or catastrophe, in the language of Thorn), with codimen- 
sion 2. The fold extending from it, though not 'physical' in the above least-action sense, is 
a catastrophe of codimension 1. For points x in the vicinity of the cusp, the WKB approxi- 
mation (|7p for the value of the stationary density po(x) breaks down. The proper treatment 
of points near the cusp and the fold is similar to the short-wavelength treatment of wave 
fields near caustics 0, [T2J. The cusp is said to have singularity index 1/4, and points on the 
fold would (if it were physical) have singularity index 1/6. This means that at these singular 
points the prefactor in the WKB approximation to po, which formally diverges, if properly 
constructed would acquire a factor e~ 1//4 (resp. e -1 / 6 ). There is a non-WKB (but uniformly 
valid) approximation to po(x) in the vicinity of each such singular point, in terms of canon- 
ical diffraction functions. We shall re-derive these facts in Section ^, in terms of scaling 
functions. 

We shall show in Section § that the singularity index of the point singularity appearing 
at the saddle, in critical models, depends in a universal way on p, i.e., on the ratio of the 
eigenvalues of the linearization of the drift u at the saddle. It turns out to equal (p + l)/6. 
Moreover, the approximations to po an d pi near the saddle are given by non-canonical 
diffraction functions. By using the non-canonical approximation to p\ to compute the rate 
at which probability is absorbed on the separatrix we shall quantify the universal non- 
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Figure 4: The flow field of the instanton trajectories emanating from both stable fixed points 
S and 5", in a critical version of the standard double well model ([]). This figure reveals 
that at criticality, a two-sided caustic extends sideways from the saddle point. Although 
a universal phenomenon, this caustic is nongeneric in the sense of singularity theory. Here 
/J, — 1, and the parameter a is set equal to the corresponding first critical value a c = 4, as in 
Fig- 0(b). 

Arrhenius behavior of the weak-noise MFPT asymptotics. We shall show that in symmetric 
double well models at criticality, 

(r) ~ const x e s exp[+AW/e], e -> 0, (9) 

where s = s(/x) = (p + l)/6 is the index of the singularity at the saddle. We shall also 
derive scaling corrections, in the weak- noise limit, to the normal distribution of exit location 
points near the saddle. The preceding results will hold for all \i satisfying 3/4 < fi < 3; the 
weak-noise asymptotics of models with \i < 3/4 and fi > 3 are still under investigation. 

The point singularity appearing in critical models at the saddle point, which may be 
termed a nascent cusp, is nongeneric. It is not a member of the well known family of 
singularities that includes folds, cusps, swallowtails, etc. This becomes clear if one plots the 
flow field of the instanton trajectories emanating from both S and 5" in the standard model 
at criticality (// varying, and a set equal to its //-dependent first critical value). At least 
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when 3/4 < /j, < 3/2, one finds that at criticality a two-sided caustic extends transversally 
from the saddle point itself. (Cf. Dykman et al. |12|.) Figure f|, which is an extended version 
of Fig. 0(b), shows the flow field when fi — 1 and a = a c = 4. The caustic is clearly visible. 
It is not 'physical,' since it is formed by high-action instanton trajectories that have crossed 
the separatrix. But the 'nascent' cusp is clearly a cusp in its own right, of an unusual sort. 
Numerically one finds that the two-sided caustic extending from it is located at 

|x| ~ const x \y\ (3/2 '^ r \ y -> 0. (10) 

A conventional (generic) caustic would have an exponent of 3/2. The continuously varying 
exponent (3/2 — /i)" 1 , which turns out to be universal and which we shall derive in Section |8] 
from our scaling theory, signals that the two-sided caustic is nongeneric. The nascent cusp 
from which it extends is itself nongeneric in the sense of singularity theory. 

As Berry 0] has emphasized, nongeneric singularities arise from catastrophes of infinite 
codimension. It is remarkable that a singularity of such complexity is a universal feature of 
singly parametrized symmetric double well models with non-gradient dynamics. 

5 Quantitative Semiclassical Asymptotics 

We now begin a quantitative treatment of the weak-noise asymptotics for escape. We first 



recast our earlier results in a form that facilitates the analysis of singularities. In Section 5.1 



we discuss geometric aspects of the WKB approximation, and in Section [572] we discuss our 
matched asymptotic approximations technique for computing MFPT asymptotics. In Sec- 
tion [O] we use the standard model (H) to illustrate the nature of the nascent cusp appearing 
at the saddle at criticality, and the ways in which bifurcation can be viewed as a phase tran- 
sition. In Section |5.4| we explore the bifurcation phenomenon from a classical mechanical 
point of view, and relate it to the appearance of a transverse soft mode. We explain how its 
appearance is governed by a Jacobi equation, and how this equation determines whether or 
not a given double well model is at criticality. 

5.1 The WKB Approximation and Classical Mechanics 

The time-independent forward Smoluchowski equation £*po = may be written as 

H(x, -eV)p = (11) 

where 

H(x,p) = p 2 /2 + u(x) p (12) 
is the so-called Wentzell-Freidlin Hamiltonian [Q, whose dual is the Onsager-Machlup La- 



grangian [33] 



L(x,x) = \x -u(x)\ 2 /2. (13) 
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In equation ( |i~T| ) we have adopted an operator ordering convention according to which the 
action of V precedes that of x. 

In the weak-noise (e — > 0) limit the stationary density po and the quasistationary den- 
sity p\ are given in the interior of each well by a WKB, or semiclassical form. A full WKB 
expansion for po would be of the form 

Po(x) ~ [K^(x) + eK {1) (x) + ---]exp[-W(x)/e], e -> 0, (14) 

as in geometrical optics. By substituting this formal series into (|Tl|) and examining the 
coefficients of each power of e, one obtains equations for W and the K^ m \ That is what 
we shall do, though we shall work only to leading order: our WKB Ansatz will be po(x) ~ 
K(x) exp[—W(x)/e]. Notice that since the eigenvalue Ai = Ai(e) of pi is exponentially small 
as e — > 0, the asymptotic expansions (in powers of e) for p\ and po will be the same. To see 
the difference between them, which is significant only near the separatrix between the two 
wells, one would have to go 'beyond all orders' in the WKB expansion. 

The eikonal equation for W is the time-independent Hamilton- Jacobi equation 

H(x,VW) = (15) 

so that W is a classical action at zero energy. For any point x in either well, it may be 
computed by integrating the Lagrangian along the zero-energy classical trajectory extending 
from S (resp. S') to x. Each such trajectory, which satisfies the Euler-Lagrange equations, 
is interpreted as a most probable fluctuational path in the e — > limit. These trajectories 
are the 'instanton trajectories' of the last section; the term is justified by analogy with the 
semiclassical limit in quantum mechanics and quantum field theory 0]. In the language of 



Gutzwiller [Tj|, the points at which the instanton trajectories focus would be called zero- 
energy conjugate points. 

It is convenient to work in the Hamiltonian picture, according to which the classical tra- 
jectories of interest lie on a zero-energy surface in a nonphysical phase space, coordinatized 
by position x and momentum p. The flow in this phase space (2n-dimensional, if config- 
uration space is n-dimensional) is determined by Hamilton's equations and the Wentzell- 
Freidlin Hamiltonian. From this point of view the instanton trajectories of Figs. ^ and ^ 
are mere images of phase-space trajectories, projected 'down' to configuration space by the 
map (x,p) i — ^ x. The phase-space trajectories emanate from (S, o) (resp. (S',o)). In WKB 
theory the projected trajectories are traditionally called characteristics , and the phase space 
trajectories bicharacteristics . Characteristics may intersect, as in Figs. |(c) and |(d), but 
bicharacteristics may not. 

It is easy to verify, using Hamilton's equations, that (S, o) and (5", o) are hyperbolic fixed 
points of the Hamiltonian flow. And the unstable manifold of (S, o), for example, comprises 
all points (x,p) that lie on one of the bicharacteristics emanating from (S, o). The unstable 



manifolds of (S, o) and (S", o) are Lagrangian |[25|| : they are invariant under the Hamiltonian 
flow. By the term 'Lagrangian manifold' we shall refer to either of these two unstable 
manifolds, or their union. We denote by M. this union, i.e., the set of all points (x,p) that 
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lie on a bicharacteristic emanating from either (S,o) or (S',o). If configuration space is 
n-dimensional, Ai will be an n- dimensional manifold. 

Each point P = (x, p) on M. has a value for the zero-energy action W associated with it, 
computed by 

W{P) = fp-dx, (16) 



the line integral being taken along the bicharacteristic terminating at P. If due to intersecting 
characteristics, or the crossing of characteristics from one well to the other, there are several 
manifold points Pi = (x,p^) 'above' some point x, then W(x) and its gradient p = p(x) 
will in a mathematical sense be multivalued. As a function of x, W may in fact have 
branch points, branch lines (cuts), etc. But the physical action W(x) appearing in the 
WKB approximation will be single- valued: it will equal the minimum of the values W(Pi) 
at the manifold points above x. This 'least action' computation determines which instanton 
trajectories are physical. 



The WKB prefactor K satisfies an easily derived transport equation. (Cf. Talkner f38j.) 
If one uses the fact that x = p + u(x) (which is one of Hamilton's equations), the transport 
equation takes on the comparatively simple form 

K = -[V- u + V 2 W/2}K, (17) 

the time derivative referring to instanton transit time, i.e., to motion along a characteristic 
or bicharacteristic. Similarly to W, K may be regarded as a function on M. rather than on 
configuration space. Integration of the equation ( |T7D requires knowledge of the second spatial 
derivatives of W along the characteristic. But (d 2 W/dxidxj)(x) equals (dpi/dxj)(x), which 
is a measure of the 'slope' of the manifold above the point x. By differentiating Hamilton's 
equations it is easy to show that the Hessian matrix Z = (Zij) whose elements are the partial 
slopes dpi/dxj satisfies the matrix Riccati equation 

Z = -Z 2 - ZB - B l Z -J2piY {1) (18) 

i 

along any characteristic. (Cf. Ludwig |2?J.) Here B = (dui/dxj) and = (dui/dxidxj) 
are auxiliary matrices. Since V 2 !^ = tr Z, the computation of K by numerical integration 
is straightforward. 

It is interesting to compare these results with those of Littlejohn |25| on the WKB pre- 
factor for the solutions of the Schrodinger equation in the semiclassical (h — > 0) limit. He in- 
troduces a Lagrangian manifold, and a similar integration along characteristics. But because 
he analyses the time-dependent Schrodinger equation, he finds that the transport equation 
for his analogue of K can be integrated explicitly, yielding a Van Vleck determinant. Matters 
are not so simple in the time-independent case, for the Schrodinger equation as well as for 
the Smoluchowski equation. Our WKB analysis of the weak-noise limit of the stationary 
density actually has more in common with the work of Gutzwiller on the semiclassical ap- 
proximation of fixed-energy quantum-mechanical Green's functions |17, 18, than it does 



with the semiclassical approximation of time-dependent quantum-mechanical propagators. 
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The prefactor K is analogous to the prefactor of a semiclassical Green's function (at fixed 
energy). It can in fact be related to the density of bicharacteristics on the Lagrangian man- 
ifold. This resembles Gutzwiller's interpretation of the prefactor of a semiclassical Green's 



function in terms of the density of classical trajectories on an energy surface [18| 



5.2 Matched Asymptotic Approximations and the MFPT 

We now specialize to two-dimensional double well models with the structure of Fig. [l|. On ac- 
count of symmetry and smoothness we may expand the drift u = (u x ,u y ) thus: 

u x (x,y) = v (x) + v 2 (x)y 2 H 

u y (x,y) = ux(x)y + u 3 (x)y 3 H . (19) 

By assumption Vq(x) > for all x between and x s , and Ui(x) < for all x between and x s 
inclusive. If the symmetry through the axis is unbroken, W and K (both of them computed 
by integration along instanton trajectories emanating from S) will have similar expansions 

W(x,y) = w (x)+w 2 (x)y 2 /2\ + --- (20) 
K(x,y) = k (x)+k 2 (x)y 2 /2\ + ---. (21) 

Here w 2m (x) = d 2m W/dy 2m (x, 0) and k 2m {x) = d 2m K/dy 2m (x, 0). Since W can be viewed as 
a classical action, the functions w 2m can be expressed in terms of the momentum p = p(x) of 
the instanton trajectories passing through near-axis points x. For example, w' (x) = p x (x, 0) 
and w 2 (x) = dp y /dy(x,0). Substituting the WKB Ansatz into the Smoluchowski equation 
£*Po = 0, and examining the coefficients of each power of e and y, will yield equations for the 
various coefficient functions in (p0|) and (|2l|). One finds in particular that w' = p x = — 2u , 



Wo (x) =2 v (x')dx'. (22) 



or 



Therefore the Hamilton equation x = p x + vq{x), which follows from the Wentzell-Freidlin 
Hamiltonian (0), implies that x must equal —vq{x) at all points between S and the saddle. 
The instanton trajectory on the x-axis moves with a speed equal to the local value of the 
drift speed, but in the direction opposite to the drift. 
Examining coefficients also yields the two equations 

k = -[u 1 + w 2 /2]k (23) 
w 2 = —w\ — 2u\W 2 + 4t> t> 2 (24) 

where we have changed the independent variable from x to t by writing k for — Vok' , and w 2 
for —vow' 2 . Equations ( p3|) and ( p4|) could equally well be deduced from flUTD and fll8D - For 
later reference we note that 

W4 = -V w' 4 

= -4(w 2 + u^w^ - 3[(w' 2 ) 2 + 4v 2 w' 2 + 8u 3 w 2 ] + 48f w 4 (25) 
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is the equation satisfied by the fourth derivative w<± = d 4 W/dy 4 = d 3 p y /dy 3 on the x-axis. 

The physical interpretation of the functions k Q and w 2 is straightforward. The WKB 
Ansatz implies that 



p (x,y) ~ k (x)exp{ - 



y 2 

w (x) +w 2 (x) — 



A , e -> 0. (26) 



So when e is small, w 2 governs the small transverse fluctuations about the x-axis. At any 
time when the system state x has fluctuated leftward from x s to x, the distribution of the 
transverse component y will (provided that w^x) > 0) be approximately Gaussian, with 
variance ~ e/w 2 {x). Of course such fluctuations are exponentially rare, on account of the 
exp[— w (x)/e\ factor. 

The Riccati equation (J23|) therefore gives the position dependence of the width of the 
'WKB tube' of probability density surrounding the MPEP, when this MPEP is in fact the 
line segment between S and the saddle. Moreover this equation captures the essence of 
the bifurcation phenomenon, as we shall see in Section |5.4| . For the moment we note only 
that it may readily be integrated from t = —00 (when the instanton trajectory formally 
emerges from S) to t — +00 (when the trajectory, obeying x = —vo(x), reaches the saddle). 
Since v (x(t)) — > as t — > ±00, we see from ( |24|) that w 2 must converge as t — > +00 
(resp. t — > —00) to one of the two zeroes of the quadratic polynomial 

- w 2 — 2u\W 2 — — w 2 {w2 — 2|«i|), (27) 

where u\ signifies «i(0) (resp. Ui{x s )). On physical grounds one expects that usually ('gener- 
ically') the WKB tube will have a finite variance at both endpoints, i.e., as t — » —00 
and x — > x s , and as t — » +00 and x — > 0. So w 2 (i s ) should equal 2\ui(x s )\, and 102(0) should 
equal 2|w 1 (0)|. 

If these endpoint ('turning point') conditions hold, it is easy to match the tube approx- 
imation ( f26|) to auxiliary, non-WKB approximations valid near the endpoints: the stable 
points and the saddle. On physical grounds, po and p\ may be approximated on the 0(e 1 ^ 2 ) 
lengthscale near S by a Gaussian function of the system state x. Let us write v x and u y for 
du x /dx(S) and du y /dy(S), the two (negative) eigenvalues of the linearization of the drift u 
at S. Then 

Po(x, y) ~ const x e -KI(^) 2 A e -KI?/ 2 A e ^ (28) 

near S, the same being true of p\. Since v y = ui(x s ), this will match to the tube approxi- 
mation if w 2 (x s ) = 2\ui(x s )\. Similarly, on the 0(e 1//2 ) lengthscale near the saddle, p may 
be approximated by the inverted Gaussian 

p (x, y) ~ const x e +^ 2 A e HWA e ^ . (29) 

Since X y = tti(0), the tube approximation will match to PD] ) if ^(0) = 2|iti(0)| and k (0) is 
finite and nonzero. It is easy to verify that the approximations ( pip and ( p9|) satisfy the 
time-independent Smoluchowski equation on the 0(e 1//2 ) lengthscale near their respective 
turning points. 
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The appropriate (generic) approximation to the quasistationary density p\ near the saddle 
is slightly more complicated; it is an error function approximation of the sort first used by 
Kramers |2|. We have @ 



Pi{x,y) ~ const x je- 1 / 2 j o exp [-(xp x + p 2 /4A x )/e] dpA e~^ v / e (30) 
= const x erf(\l /2 x/e 1/2 )e +x * x2/e e-^ y2/e (31) 



on the 0(e 1//2 ) lengthscale. This 'boundary layer' approximation agrees with the inverted 
Gaussian approximation (|29|) in the far field, i.e., as x/e 1 / 2 — > +00. So under the same 
conditions, the tube approximation (p6|) will match to it. 

We have now approximated pi(x) at all points x in the vicinity of the line segment joining 
S and the saddle. We must emphasize that the validity of this procedure depends on two 
assumptions: 

• That the physical values of W(x) and K(x) at all points x along the axis arise from 
integration along the on- axis instanton trajectory extending from S to x. 

• That the WKB tube surrounding the axis is well behaved as the saddle is approached, 
so that the error function approximation to the quasistationary density is valid near 
the saddle. This requires that w 2 — > 2|w 1 (0)|, and that k tend to a finite, nonzero 
limit. 

The first assumption breaks down when the MPEP has bifurcated, and we shall see that the 
second assumption breaks down at the onset of bifurcation. But if both assumptions hold, 
it is easy to compute the weak-noise asymptotics of the quasistationary eigenvalue Ai and 
its asymptotic reciprocal, the MFPT (r). The time-dependent equation p = —C*p may be 
written as 

p + V- [-(e/2)Vp + /m] = 0. (32) 

Equation fl32|) is a continuity equation, and j = — (e/2) Vp+pu can be viewed as a probability 
current density. Since Ai is the decay rate of the eigenmode pi, it may be computed as the 



rate at which probability is absorbed on the separatrix [22, R3|. Necessarily 



Ai=/ [-j x (0,y))dy / / p 1 (x,y)dydx (33) 

J— 00 / JO J— 00 

where j = (j x ,j y ) is computed from p\. The numerator (an absorption rate) is computed 
from ([3~T1), and the normalization factor in the denominator from the Gaussian approxi- 
mation (p8|) . If the constant prefactors of these two approximations are chosen to ensure 
consistency with the intermediate WKB tube approximation (|26|) , the quotient will acquire 
a factor fc (0) exp[-w (0)/e], »-e., K(0, 0) exp[-W(0, 0)/e]. 

This computation, if carried through, yields a so-called Eyring formula for the weak-noise 
asymptotics of the quasi- stationary eigenvalue, i.e., the weak- noise asymptotics of the rate 
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of noise-activated hopping || 







exp[-AW/e], e -> 0. (34) 

Here the presence of the 'frequency factor' K(0, 0) is attributable to the non-gradient dy- 
namics; it will equal unity if the drift it is a gradient. The formula is otherwise familiar. 
Since (r) ~ A^f 1 as e — ■> 0, this formula predicts a pure Arrhenius growth of the MFPT in 
the weak-noise limit. But as noted, this conclusion depends crucially on the validity of the 
Kramers-type error function approximation to the quasistationary density near the saddle. 
This approximation will prove not to be valid in double well models undergoing a bifurcation. 



A x (e) ~tf(0,0) 



yKWx\yWy\/\\ 
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5.3 Indications of a Phase Transition 

We shall now explain how the bifurcation transition displays characteristic features of a phase 
transition, such as power-law divergences governed by critical exponents. We begin by using 
the standard double well model @, and the transport equations of the last section, to reveal 
the nature of the 'nascent cusp' singularity appearing at the saddle point, at criticality. 

For the standard model, the stable point S is located at x = x s = 1, and the coefficient 
functions (drift velocity derivatives) in the transport equations are of the form 

Vo(x) = u x (x, 0) = x — x 3 (35) 
2v 2 {x) = d 2 u x /dy 2 {x,0) = -2ax (36) 
U\{x) = du y /dy(x, 0) = — + x 2 ). (37) 



These may be substituted into the Riccati equation (ITS) for the transverse second derivative 
w 2 (x) = d 2 W/dy 2 (x, 0), and the equation numerically integrated. As noted, the appropriate 
initial condition is w 2 (x = x s ) = 2\ui(x s )\, i.e., w 2 (x = 1) = 4/x. Consider the case fj, — 1 
(the subject of Fig. 0), in particular. One finds for all a in the range < a < 4 that w 2 is 
positive on the line segment between x = x s and the saddle at x = 0. Since the WKB tube 
centered on the axis, which is formed by small transverse fluctuations about the MPEP, has 
variance ~ e/w 2 (x), this positivity implies that the tube is everywhere well-defined. One also 
finds that w 2 2, i.e., w 2 — > 2|wi(0)|, as the saddle is approached. Moreover, by integrating 
the transport equation ( p3|) one finds that ko(x) = K(x,0) tends to a finite, nonzero limit 
as x —>■ 0. As we explained above, these two conditions are precisely what is needed to ensure 
Arrhenius weak-noise asymptotics, with an MFPT prefactor proportional to i^(0,0) _1 . 
The bifurcation transition present in the \i — 1 standard model at a = 4 is reflected in 



the behavior of w 2 and k as x — > 0. When a = a c = 4, equations and (^) can be 
solved exactly; one finds 

W2 ( x ) = d 2 W/dy 2 (x,0) = 4x 2 , (38) 

fco(z) = K(x,0) = l/x. (39) 
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Figure 5: A plot of K(0, 0), to which the weak-noise activation rate prefactor is proportional, 
as a function of the off-axis softening parameter a. As shown, K(0, 0) diverges as a — > a~ . 
This is for the standard double well model (|6]), with \i = 1, for which a c = 4. 

We know from the Eyring formula that the activation rate prefactor, in the limit of weak 
noise, is proportional to K(0, 0). The fact that ko{x = 0) = K(0, 0) is infinite here strongly 
suggests that at criticality the activation rate, i.e., the rate at which the quasistationary 
density is absorbed on the separatrix, is anomalously large. Equivalently, it suggests that 
at criticality the weak-noise behavior of the MFPT (which is asymptotically equal to A^f 1 ) is 
non-Arrhenius , with a pre-exponential factor that tends to zero as e — > 0. There is an even 
stronger piece of evidence that this is the case. It is not difficult to show, by analysing the 
transport equation (j2"3|), that K(0, 0) = k (x = 0) ~ (« c — a) -1 / 2 as a — > a~ . Figure |5| shows 
the result of a numerical computation when \i — 1. The activation rate prefactor diverges 
as a —>■ a~ . Equivalently, the MFPT prefactor tends to zero. The natural deduction is that 
at criticality the activation rate prefactor, and its reciprocal the MFPT prefactor, become 
e- dependent. This blends nicely with the behavior above the transition, since (as shown 
in Fig. |3|) the exponential growth rate of the MFPT (the action barrier AW, i.e., W(0,0)) 
begins to decrease as a increases beyond a c . An e-dependent activation rate prefactor at 
criticality, containing a negative power of e, would unify the exit behavior both below and 
above criticality. 

We shall show in Section ^| that in critical double well models (e.g., in the standard model 
with a = a c ), the weak-noise activation rate Ai = Ai(e) indeed has asymptotics 

Ai(e) ~ const x e" s exp [-AW/e] , e -»• 0, (40) 

where s > is the singularity index mentioned in Section |J The computation of the 
singularity index is nontrivial. Since the MFPT (r) satisfies (r) ~ Af 1 , the e~ s prefactor 
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in Ai gives rise to an e s MFPT prefactor. In critical double well models, in the weak- noise 
limit the growth of the MFPT is slower than pure exponential. 

At criticality, the action W as well as the prefactor K displays unusual behavior at the 
saddle. We shall see that the behavior of (0), i.e., that W2 tends to zero quadratically 
as x —>■ 0, is universal. Since the WKB tube has variance ~ e/w2(x), this implies that the 
tube splays out as the saddle is approached. To leading order, it splays out to infinite width. 
This is an indication that the transverse fluctuations around the MPEP, on the 0(e 1 ^ 2 ) 
lengthscale, at criticality become very strong near the saddle. In fact, that w 2 (x = 0) = 
causes some difficulty in the interpretation, near the saddle, of the WKB approximation to po 
and p\. One might expect that even though wi{x = 0) = 0, the quartic tube approximation 



Po{x,y) ~ A; (x)exp 



2 4 

y y 

w (x) + w 2 {x)— + w A (x) — 



A 







(41) 



would suffice for an understanding of the behavior of the WKB approximation near the 
saddle. If W2{x = 0) were zero but w±(x = 0) were finite and nonzero, transverse fluctuations 
around the saddle would be, by (f4~ID , of magnitude 0(e 1//4 ) rather than 0(e l l 2 ). However, 
explicit solution of eq. (p5|), the transport equation for W4 = cftW/dy 4 = d 3 p y /dy 3 , shows 
that in the p — 1 standard model at a = a c = 4, 



w 4 (x) ~ (4/5)x" 4 + (16/5)x~ 2 + 8 + 



Cr 



(42) 



The fact that w^x = 0) is infinite, coupled with the fact that W2{x = 0) is zero, suggests 
that at criticality, the transverse fluctuations near the saddle have no natural scale. In any 
event, at criticality the standard matched asymptotic approximations technique of the last 
section breaks down. We shall need to construct an approximation to the quasistationary 
density p\ near the saddle which (i) is valid at criticality, and (ii) matches to the WKB 



approximation ([11]), despite its singular character. 

Critical exponents, as we define them, describe the weak-noise behavior of a parametrized 
double well model with a singularity at some point x , as x — > x and as the parameters 
of the model tend to the values for which the singularity appears at x . In particular, 
they characterize the behavior at and near the bifurcation transition, and at and near the 
saddle point, of the functions W and K appearing in the WKB approximation to po and p\. 
At criticality, the divergence rates of the WKB prefactor K as x — > and y — > supply 
two such exponents; the scaling form which we shall use to approximate W near a nascent 
cusp (which involves fractional powers) will supply others. There are also critical exponents 
describing what happens as one moves off criticality. As a is increased above a c (i.e., above 4, 
in the p — 1 standard model), the MPEP bifurcates. There is a critical exponent describing 
the separation rate of the two resulting MPEP's, as Fig. ^ makes clear. There is also a 
critical exponent describing the divergence rate of K(0, 0) as a — > a~ , which as we have 
already noted equals 1/2. The singularity index s can be regarded as a critical exponent too, 
though of a different kind; to compute it, one must go beyond the WKB approximation. 

The 'nascent cusp,' as a singularity, is located in a space parametrized by x, y, and 
the parameter(s) of the drift field u. But if one restricts oneself to a single double well 
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model, the only parameters are x and y. In this case there is a natural analogy between the 
Lagrangian manifold Ai in phase space, formed by bicharacteristics emanating from (S, o) 
and (S',o), and a thermodynamic surface. The action W, as a function on the manifold, 
corresponds to a thermodynamic potential, in fact a Gibbs free energy. The equation p = 
p(x) = dW/dx corresponds to a relation between conjugate state variables, such as pressure 
and volume. The singularities (points of non-differentiability) of the physical action W(x) 
therefore correspond to phase transitions. The order of such a phase transition, in the 
traditional sense, is the lowest order of spatial derivative (of W) which fails to be continuous. 
In Section § we shall compute the order of the nascent cusp. 

5.4 The Bifurcation Transition and Classical Mechanics 

We now explain how the equations of Section [572] allow the bifurcation transition to be 
interpreted in terms of classical mechanics, and how one can predict whether or not any 
given double well model is at criticality. We begin by considering models in which the 
MPEP has already bifurcated, and the instanton trajectories emanating from the stable 
point S focus along the axis, as in Figs. 0(c) and 0(d). 

Empirically, focusing occurs in models with a sufficiently large 'off-axis softening' param- 
eter a. In such models, the action W in a mathematical sense becomes multivalued near 
a portion of the axis. Points (x, 0) beyond the first focus are reached by multiple off-axis 
instanton trajectories emanating from the stable point S, and in general these trajectories 
will have different actions. They will also have different momenta p = VW at the time they 
reach (x, 0). 

This multivaluedness has a geometric interpretation, in terms of the shape of the two- 
dimensional Lagrangian manifold (in the four- dimensional phase space) formed by the bichar- 
acteristics emanating from the point (x, p) = (S, o). As x decreases from x s toward zero, the 
map y t— > p y in the vicinity of y — is at first single- valued; the value p y = 0, and no other, 
corresponds to y — 0. Beyond the first focus (arj 1 , 0), i.e., when x < x}\ the map y i— > p y 

becomes three- valued. At the second focus (xj , 0) it becomes five- valued, etc. The generic 
evolution is shown in Figs. |6](a) through ||(f). Up to the first focus y = y{p y ) near p y = may 
be modelled as a linear function; beyond the focus, as a cubic. Beyond the second focus the 
global description becomes more complicated, as is clear from the whorl in Fig. ^(f). A cubic 
approximation is still appropriate in the immediate vicinity of {y,p y ) = (0,0), however. 

Since the locus of all points (y,p y ) at constant x is obtained by intersecting the La- 
grangian manifold with the hyperplane x = const, the manifold itself becomes increasingly 
'whorled' with each passage through a focus. The formation of convolutions in Lagrangian 
manifolds was first considered by Berry and Balazs || (in a time-dependent context), and 
the progression in Fig. [] resembles the figures in their paper. Geometrically, the linear-to- 
cubic transition at each successive focus corresponds to the creation of a fold fl2fl . One can 
fit the shape of the manifold near the Z'th focus, i.e., near (x,y,p y ) = (x^,0,0), by the 
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Figure 6: Cross-sections through the Lagrangian manifold Ai, revealing the 'whorling' that 
takes place as one passes through any on-axis focus. These sketches show the map y i— > p y 
at successively decreasing values of x, as one moves from S = (x s ,0), past two foci (x^\o) 



and (xj ,0), toward the saddle point (0,0). Shown are the cases (a) x s 
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near x s , (b) x s > x > x^p and x near xy , (c 
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phenomenological formula 

V = y(x, p y ) = -ajppj - ar (x - arjP) p y (43) 

where and are certain positive constants. So each successive focus resembles a 
Ginzburg-Landau second-order phase transition (x corresponding to temperature, the focus 
location Xf to a critical temperature, — y to a magnetic field, and p y to a magnetization). 
We shall say more about the 'equation of state' fl43|) (which we stress is not applicable near 
the 'nascent cusp' appearing at the saddle point of critical models) in Sections |7| and |. 

For on-axis (i.e., y — 0, p y — 0) trajectories, the derivative w 2 (x) = dp y /dy(x,y = 0) 
satisfies the Riccati equation (|2~3|). So the appearance of a focus, and of multiple foci, can 
be investigated analytically. It is clear from Figs. ||(c) and ||(e) that passage through a 
focus is signalled by the tangent plane to the manifold (at y — 0, p y — 0) 'turning vertical'; 
equivalently, by dy/dp y passing through zero, or its reciprocal w 2 (a negative magnetic 
susceptibility, in this context) passing through — oo. To study this, recall that the Riccati 
equation 

w 2 = —vj\ - 2uiw 2 + 4:V v 2 (44) 

involves a derivative with respect to instanton transit time, and that the on-axis instanton 
trajectory (directed anti-parallel to the drift toward S) satisfies x = —vo(x). Solutions w 2 
can be regarded either as a function of t, for — oo < t < oo, or of x, for x s > x > 0. We see 
from the form of the Riccati equation that w 2 can indeed be driven to — oo in finite time, 
i.e., at some point x = Xf > to the right of the saddle. In fact one sees, if tf is the time 
when this occurs (the focus time), that as t — > tj , i.e., x — > x~j , 

W2 (x) = d 2 W/dy 2 (x, 0) ~ -(t f - t) -1 , 

~ const x [— (x — Xf)' 1 ]. (45) 

Here the constant multiplier equals l/v (xf), the reciprocal speed of the on-axis instanton 
trajectory when it passes through the focus {xf, 0). We note in passing that by the transport 
equation (PB"D, this blowup will induce a blowup of the on-axis WKB prefactor k . One finds 

k (x) = K(x, 0) ~ const x (t f - t)~ 1/2 , 

~ const x (x — Xf)' 1 ! 2 . (46) 

Equations (f4l^-(fl6|) contrast markedly with eqs. (|38|)-(p9|), which apply to the /i = 1 stan- 
dard model at criticality (where, in a formal sense, Xf = 0, since there is a nascent focus 
at the saddle). Equations (|45|)-(f46[) are not restricted to the standard model; they hold in 
greater generality. But they apply only when a bona fide focus is present at some Xf > 
(i.e., before the saddle is reached), and the MPEP has already bifurcated. 

By examining the Riccati equation (0), we see that w 2 will be driven to — oo, and a 
focus will be present, only if the inhomogeneous term 4vqv 2 on the right-hand side of fl4"4] ) is 
sufficiently negative. (This is because u\ < 0, by assumption.) But 2v 2 = d 2 u x /dy 2 (x, y = 0), 
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Figure 7: The transverse action derivative W2(x) = d 2 W/dy 2 (x, 0) in the — \ variant of the 
standard double well model (|5|), in the vicinity of the bifurcation transition. The three curves, 
from top to bottom, obtain when a = 3.9, 4.0 (the critical value a c ), and 4.1. When a < a c , 
W2 — > 2\Xy\, i.e., 2, as x — > + . When a > a c , W2 is driven negative as x decreases, and 
passes through — oo. The focal point x = Xf where this occurs, in the model with a = 4.1, 
is indicated by a dashed line. 

which we are taking to be negative when < x < x s , measures the extent to which the drift 
toward S softens as one moves off-axis. So our empirical observation is confirmed analytically: 
a sufficiently strong off-axis softening will create a focus, and a bifurcation of the MPEP! 

It is best to think of wi = dp y /dy(y = 0) as a slope, as in Fig. |[ As such, it may 
rotate repeatedly through the point at infinity as t increases, i.e., as x decreases. Each such 
rotation results in increased whorling of the Lagrangian manifold, and also corresponds to 
a passage through a focus. So by counting the number of singularities of the solution curve 
W2 = W2(t), one may determine the number of foci present in any given double well model. 

The standard model @ will serve as an example. For the reasons discussed in Section |5.2| 
W2(t = —oo), i.e., W2(x = x a ), in the standard model always equals 2|«i(x s )| = 2|wj.(l)| = 4/i. 
Suppose that /x = 1. We noted in the last section that if < a < 4, u>2 is well-behaved 
and positive at all times t between — oo and oo inclusive, i.e., at all x satisfying < x < 1. 
We also explained what happens at a = a c = 4, when the nascent cusp appears at the 
saddle and the MPEP begins to bifurcate. At criticality, W2 — > as t — > oo, i.e., as x — > + . 
If 4 < a < 9, w 2 is driven negative (as t increases toward oo), and passes through — oo before 
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returning (through +00) to finite, positive values. The change in behavior is shown in Fig. [7[ 
When a is raised above a c , we say that the graph of w 2 acquires unit winding number, since 
it winds once through the point at infinity. A second transition occurs at a = = 9. 
If 9 < a < 16, W2 passes through —00 twice, and its graph has winding number equal to 2. 
Except at the critical values a = 0%' = (j + l) 2 , w 2 in this model converges to the generic 
value 2|ui(0)| = 2 \X y \ = 2 as t — ► 00, i.e., as x — » + . 

Since each passage of w 2 through —00 gives rise to a focus, the sequence of near-axis 
instanton flow fields in the \i — 1 standard model, as a is increased, displays an progres- 
sively larger number of foci. In fact the progression is precisely as displayed in Figs. |2|(a) 
through 0(d). It is worth noting that in models with one or more foci, the WKB tube cen- 
tered on the axis becomes ill-defined when w 2 goes negative, which takes place at a location 
on the axis somewhat before the first focus is reached. (See Fig. 0.) 

In Section || we shall determine exactly what happens at the bifurcation transition of 
any singly parametrized symmetric double well model. But we can now pose the question: 
What, physically, causes the above values for a to be critical? If in general the odd function 
2v 2 (x) = d 2 u x /dy 2 (x,0) is negative between x = and x = x s and is proportional to a 
parameter a, is there a classical mechanical technique of predicting the values of a at which 
the on-axis instanton trajectory will bifurcate? The answer to this question is 'yes.' Our 
technique relies on a linear stability analysis of the on-axis instanton trajectory, and iden- 
tifies the critical values of a as the values for which a transverse soft mode is present in 
the zero-energy Hamiltonian dynamics. This is reminiscent of Langer's analysis of metasta- 
bility in one-dimensional models |24|, |36|. But because we shall consider transverse, rather 
than longitudinal, fluctuations around the instanton trajectory, our stability analysis will be 
considerably simplified. 

Let x = x*(t) = (x*(t),0) be the on-axis instanton trajectory, where x = x*(t) is the 
solution of x = —Vq(x). Near-axis instanton trajectories, i.e., near-axis zero-energy classical 
trajectories emanating from S, may to leading order be written as 

x = x {t) = (x(t),y(t)) ~ (x*(t),0) + S(0,Y(t)) (47) 

where 5 <C 1, and where Y = Y{t) is some model-dependent function satisfying Y(t — 
—00) = 0. Y(t), —00 < t < 00, is a normalized transverse deviation. Similarly, near-axis 
trajectories have momenta 

P = p(t) = ( P MPy(t)) ~ (p*M0)+6(0,P y (t)) (48) 

for some unknown function P y = P y (t) satisfying P y (t = —00) = 0. Here p = p*(t) = 
(p*(t),0) is the momentum of the on-axis instanton trajectory at instanton transit time t. 
We noted before eq. ( p2|) that as a function of x, p* x equals — 2t> . So p* x {t) equals — 2vq (x*(t)). 
We necessarily have 

w 2 (t) = P y (t)/Y(t), (49) 
on account of w 2 equalling dp y /dy(y = 0). 
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Substituting equations (|4"7j ) and ( |4~8| ) into the Hamilton equations derived from the 
Wentzell-Freidlin Hamiltonian H, and separating terms proportional to 5, yields the pair 
of equations 



Y 



P„ 



— (X^P^t)) Y + ^ (X*(t), P *(t)) Py 



dpydy 
d 2 H 



(x*(t),p*(t))Y - 



dpi 
d 2 H 



{X*(t), P *(t))Py 



(50) 
(51) 



dy 2 ' dpydy 

Due to the special form of the Hamiltonian (|T2|), and the expansions (|19"1), this pair becomes 

Y = Ul (x*(t))Y + P y (52) 



Av (x*(t)) v 2 (x*(t)) Y — U\ (x*(t)) P y . 



(53) 



So w 2 = w 2 (t) can be represented as the quotient of two functions of instanton transit time, 
which satisfy a pair of coupled linear differential equations. We note in passing that an 
analogous representation is possible for solutions Z = Z(t) of the matrix Riccati equa- 
tion (|TH). The existence of such quotient representations is well known in the theory of 
Riccati equations, and has a geometric interpretation |37 . 
Equations ©-(H, and (UHl!)- 

may be viewed as Hamilton's equations for the ef- 



ld 2 H r>2 d 2 H vn ld 2 H^ 2 
-Pi + t^^YP v + -^rY 2 



fective (i.e., time-dependent) transverse Wentzell-Freidlin Hamiltonian 

H cS (Y,Py,t) 



2 dp 2 y 



dpydy 



2 dy 2 



= P 2 /2 + Ul (x*(t)) YP y - 2v (x*(t)) v 2 (x*(t)) Y 2 . 

To save space we have suppressed the arguments (x*(t),p*(t)) of the partial derivatives. This 
quadratic Hamiltonian governs the small transverse fluctuations about the on-axis instanton 
trajectory. Its Legendre transform YP y — H c d, namely 



L eS (Y,Y,t) 



ld 2 L- 2 d 2 L ■ ld 2 L^ 2 



2 dy 2 



dydy 



2 dy 2 



\Y - Ul (x*(t)) Y\ 2 /2 + 2v (x*(t)) v 2 (x*(t)) Y 2 



is an effective transverse Onsager-Machlup Lagrangian. Here L is the Onsager-Machlup La- 
grangian ([131), and we have suppressed the arguments (x*(t), x*(t)) of the partial derivatives. 
The corresponding Euler-Lagrange equation for the normalized transverse deviation Y, i.e., 



Y + 



d 
~dt 



[ Ul (x*(t))} - ul (x*(t)) - Av (x*(t)) v 2 (x*(t)) \Y = 0, 



(54) 



is called a (transverse) Jacobi equation J3BJ. It may be written as an equation for Y = Y(x) 
< x < x s , by changing the independent variable from t to x. One gets 



JY = 



d 

dx 



dY- 






+ 



u[(x) - 



u\(x) 

V Q (x) 



- Av 2 ix) 



Y = 



(55) 
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together with the boundary condition Y(x = x s ) = 0. This Jacobi equation, which is in 
Sturm-Liouville form, governs the behavior of the instanton trajectories near the on-axis 
trajectory (the MPEP, if it has not bifurcated). So it is responsible for the various behaviors 
shown in Fig. ^. It is clear from our derivation that the Jacobi operator J , considered as 
a quadratic form, defines the transverse second variation of the Onsager-Machlup action 
functional about the on-axis trajectory. 

Foci are by definition the points (x, 0) where the off-axis instanton trajectories converge 
(to leading order). Equivalently, (x, 0) is a focus only if Y(x) = 0. But since w 2 = P y /Y, 
this implies that (unless P y (x) = also) W2(x) is infinite. This is precisely the necessary 
condition for a focus that we derived earlier. If Y passes through zero more than once, then 
w 2 will pass through the point at infinity more than once. This is the mechanism by which, 
e.g., the two-focus flow field of Fig. 0(d) engenders the increasingly 'whorled' Lagrangian 
manifold of Figs. ||(a) through ||(f). 

We can now give a simple criterion for determining whether or not a given double well 
model is at criticality. Suppose that the most probable escape path (MPEP) extends along 
the axis from S to the saddle, so that the symmetry is as yet unbroken. We know by the 
discussion in Section (| that criticality is signalled by the appearance of a nascent cusp at the 
saddle. The nascent cusp itself is not a focus, as Fig. [|(b) makes clear. But if the off-axis 
softening is increased, the nascent cusp becomes a genuine cusp (i.e., focus); it moves inward 
along the axis from the saddle toward S. This picture is consistent with the interpretation 
of the near- axis instanton flow field in terms of the function Y(x), < x < x s , only if the 
nascent cusp, like a conventional on-axis focus, is a zero of Y . 

So the signal for criticality is Y equalling zero at x = 0. We can rephrase this as follows. 
Critical double well models are those models with unbroken symmetry for which the Jacobi 
equation JY = for the transverse deviation function Y , equipped with boundary condition 
Y{x = x s ) = and also with Y{x = 0) = ; has a nontrivial (i.e., nonzero) solution. The 
nonzero solution Y — Yx(x), < x < x s , when it exists, can be interpreted as a transverse 
soft mode of the zero-energy Hamiltonian dynamics. If the off-axis softening is increased, 
the on-axis MPEP will bifurcate. Just beyond criticality, there will be two symmetrically 
placed off-axis MPEP's from S to the saddle. They will be of the form (x*(t), ±5Y 1 (x*(t))), 
for some small 5. This 'motion in the direction of a soft mode' is a standard bifurcation 
effect. At criticality, the transverse soft mode Yi describes the way in which the two MPEP's 
separate. 

Suppose that the double well model is parametrized by an off-axis softening parameter a, 
i.e., that v 2 = «v 2 for some odd function v 2 , and that v and Ui are independent of a. Then 
by rewriting the Jacobi equation, one sees that the model will be at a bifurcation point if 
and only if the Sturm-Liouville equation 



jy 1 '' 



Avn(x) dx 



dY 



1 

+ 



4v 2 (x) 



u\(x) 
volar) 



Y = aY, (56) 



equipped with Dirichlet boundary conditions Y{x = 0) = Y{x = x s ) = 0, has a nonzero 
solution. The Sturm-Liouville operator J may be called a normalized Jacobi operator. 
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We see that the set of critical values of a is precisely the spectrum of the normalized 
Jacobi operator! Only the first critical value (i.e., lowest eigenvalue) a c = aW will yield 
an actual bifurcation of the MPEP. To each higher critical value j = 2,3, . . ., there 
corresponds a transverse eigenmode Yj. But after the first bifurcation, the on-axis instanton 
trajectory is no longer the physical MPEP. The higher eigenmodes Yj, j = 2,3, . . ., which 
are oscillatory, govern the further bifurcations of the on-axis instanton trajectory rather than 
the further bifurcations (if any) of the physical MPEP's, which have already moved off- axis. 

The case of the standard model (|6]) is instructive. Substituting from (|35|)-(|3T|) one finds 
as normalized Jacobi operator 



- Id. o. d 
J = - — — x-x 3 — + 
4x dx dx 



fi /i 2 (l + x 2 ^ 2 



2 4x 2 (l - x 



2\ 



(57) 



It is easily verified that on the interval from x — to x — x s — 1, this operator (when equipped 
with Dirichlet boundary conditions) has spectrum 

a® = j 2 + (3/i - l)j + (2/i 2 -y), j = 1, 2, 3, . . . (58) 



So in the standard model, the bifurcation of the physical MPEP occurs at a c = = 
2/i(/i + 1). Also, the standard model with /i = 1 has = (j + l) 2 , so the on-axis 
instanton trajectory bifurcates at a = 4, 9, 16, . . . We have several times mentioned this 
curious progression of squares. The eigenf unctions Yj corresponding to the eigenvalues 
i.e., the transverse soft modes appearing at a = turn out to be of the form 

Yj(x) = (x-x z Yqj(x), (59) 

where qj is an even polynomial of degree 2j — 2. Substituting into the transverse Hamilton 
equation (|52| ) yields the analogous transverse momentum deviations {P y )j- One gets 

( p y)j( x ) = -v (x)Yj(x) -u 1 (x)Yj(x) 

= (x - x 3 ) M Afix 2 qj(x) - (x - x 3 )q'j(x) 

so that in the standard model at a = 

w 2 (x) = (Py)j{x)/Yj(x) = 4/ix 2 -(x- x 3 )q'j{x)/ qj {x). (60) 

If j = 1 then qj reduces to a constant, and the second term is absent. So at the physical 
bifurcation point [i.e., at a = a c = 2fi(y + 1)], u^x) equals 4/zx 2 . Moreover, Y"i(x) = 
(x — x 3 ) M . This transverse soft mode is seen clearly in Figs. ^|(c) and 0(d), which show 
the behavior of the /i = 1 standard model beyond the bifurcation point. In those figures 
the off-axis MPEP's are roughly proportional to ±(x — x 3 ), i.e., to ±Yi.. As a is increased 
above a c , the MPEP's move in the direction of the transverse soft mode. 

Recall that the profile of the WKB tube of probability density centered on the x-axis is 
asymptotically Gaussian, and that at specified x this Gaussian has variance ~ e/u^x). But 
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in the standard model, at the first (and only physical) critical value a = a c = 2/i(/i+l), W2(x) 
equals 4fix 2 . That W2 — > as x — > implies that at criticality, the WKB tube splays out as 
the saddle is approached. We have already seen the fj, — 1 case of this in Section |5l| The 
splayout is what one would expect from our picture of the bifurcation of the MPEP, which 
begins at the saddle, as a phase transition. It simply says that on the 0(e 1 ^ 2 ) transverse 
lengthscale, the Gaussian fluctuations about the MPEP grow without bound as the nascent 
cusp is approached. 

It is easy to see that this behavior is universal: it occurs in any critical double well model 
with a bifurcating MPEP. If the (diagonal) linearization of the drift field u at the saddle 
has eigenvalues (X x , \ y ), and fi is defined as usual to equal \\ y \/X x , then examination of the 
Jacobi equation shows that the soft mode Y\ has asymptotics Yi(x) ~ CV, x — > + , for 
some nonzero constant C. We have mentioned this 'approach path' property elsewhere |[28|| . 
Also, examination of the Hamilton equation for (P y )i shows that (P y )i(x) ~ C'x^ +2 for some 
nonzero C . So at criticality, the quotient w%(x) satisfies (for any /i) 

w 2 (x) = d 2 W/dy 2 {x, 0) ~ const x x 2 , (61) 

as x — > + , and the tube splayout always occurs. Incidentally, it follows by integrating the 
transport equation (|23|) that 

fco(ar) = K(x, 0) ~ const x x^, (62) 

as x — > + . Equations (EI|)— (B^) summarize the universal behavior of the WKB tube near 
the saddle, in any critical double well model. They are the extension to arbitrary critical 
models of eqs. fl3"gp-(|3"5|) , which applied only to the critical variant (a = a c = 4) of the // = 1 
standard model. 



We stressed in Section |5.2| that a Kramers-type error function approximation to the quasi- 
stationary density p\ near the saddle is appropriate only if W2 — > 2|mi(0)| as the saddle is 
approached. At criticality, since W2 — > instead, in order to apply the method of matched 
asymptotic approximations we shall need to construct a different boundary layer approxi- 
mation. This will give rise to the universal non-Arrhenius MFPT asymptotics for models at 
criticality. 



6 Maslov-WKB Asymptotics 

By building on the previous sections, we can analyse the weak-noise behavior of double-well 
models with singularities. We have seen that singularities may appear in the WKB approx- 
imation K(x) exp[— W(x)/e] for the stationary density po an d quasistationary density p\. 
The possible singular behaviors are summed up in eqs. d45|)-(^6|), which apply to models 
in which the MPEP has already bifurcated, and eqs. fl5"Tp-(|^), which apply to models which 
are critical in the sense of bifurcations. Models in which the MPEP has already bifurcated 
have the property that the instanton trajectories emerging from S focus at a point (x/,0) 
on the axis, with Xf > 0. The prefactor K of the WKB approximation will diverge there. 
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In critical models, there is no actual on-axis focusing. But the prefactor will nonetheless 
diverge at the saddle point (0,0). 

There is a standard procedure for extending the WKB approximation to such singular 
points, by 'glueing in' auxiliary, non-WKB approximations. It originated with the work of 
Keller and Rubinow on short-wave asymptotics [P0|| , and has been most extensively devel- 
oped by Maslov [ 32] . For a mathematically rigorous treatment, see Duistermaat [JI0| . See 
also Eckmann and Seneor jy|, for a partly pedagogical one-dimensional treatment. The 



procedure may be applied to the (formal) asymptotic solutions of any partial differential 
equation of the form H(x, — eV)p = 0, where if is a specified Hamiltonian. Here we discuss 
its application to the Smoluchowski equation, in arbitrary dimensionality n. 



We know from Section |5.1| that mathematically, the WKB approximation to po an d pi 
is determined by (i) a Lagrangian manifold Ai in the 2n-dimensional phase space, formed 
by the bicharacteristics emanating from (S, o) and (S",o), and (ii) functions W and K 
defined on this manifold, and computable by integration along the bicharacteristics. Of the 
points = (x, pW) 'over' any point x, only the one with least action is physical. The values 
there of W and K are the values W(x) and K(x) appearing in the WKB approximation. 

This geometric interpretation motivates the introduction of a new, 'diffraction integral' 
way of formulating the WKB approximation. At any point P = (x, p) on the Lagrangian 
manifold Ai, we have 

W(P) = Jp-dx, (63) 

the line integral being taken along the bicharacteristic terminating at P. We can define a 
Legendre transform W, satisfying W = x ■ p — W, by 



W{P) = x-dp. 



(64) 



It is natural to think of W as a function of momentum p, by projecting 'sideways' onto 
momentum space. Of course W(p) is potentially multivalued, like W(x). For W, it is the 
least of the possible values that is physical; for W, it is the most. But if one ignores the 
multivaluedness of W (p), one can write 



K(x) exp[-W(x)/e] ~ e~ n/2 J ■■■ J K(p) exp { 
where K(x) and K(p) are related by 



x-p + W(p) /ej dpi ■ ■ -dp n , (65) 



K(x) oc K(p) / x det 



d 2 W 
dpidpj 



(P) 



K{p) 



\ 



det 



d 2 W 

dxidxj 



(66) 



the correspondence between p and x being given by p(x) = dW/dx, or x(p) = dW /dp. 
The asymptotic equality in (j65|) , as e — > 0, is justified by the method of steepest descent. 
(It may be necessary to cut off the integral at large momentum to ensure convergence.) The 
method of steepest descent automatically picks out the point P = (x, p) 'over' x with the 
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least action W(P). We shall call (|65| ) a diffraction integral representation, since (if e is pure 
imaginary) it resembles the diffraction integrals used in physical optics [Q. 

We have assumed that the Hessian matrix d 2 W / dxidxj = dpi/dxj, whose inverse is the 
matrix d 2 W / dpidpj = dxi/dpj, is negative definite. Actually it is often possible to make 
sense of the above formula? even when this is not the case, by analytic continuation. It is 
also possible to avoid the problem of positive eigenvalues by taking the Legendre transform 
with respect to a partial (incomplete) set of variables. In n = 2 dimensions, this means 
with respect to a single variable only. For example, one could use the alternative integral 
representation 

K(x, y) exp[-W(x, y)/e] ~ e~ l/2 J K {x \p x , y) exp { [-xp x + W^(p x , y)} /e} dp x , (67) 
where = xp x — W is regarded as a function of p x and y, and K and are related by 




K{x,y) <xK^(p x ,y) L ( Px , y )= K W(p x , y ) 



d 2 W 

(x,y). (68) 



dpi w>v><\ dx 2 

Here the correspondence between (x,y) and (p x ,y) is given by p x (x,y) = dW/dx, or equiv- 
alently x(p x ,y) = dW^ x '/dp x . 

It is clear that the transformed prefactor K (resp. K^ x \ etc.) in these integral repre- 
sentations, like W, K, and W (resp. W^ x \ etc.), can be thought of as a function on the 
Lagrangian manifold M.. Also, the momentum integration can be viewed as an integration 
over A4. So introducing integral representations of this sort is really a way of replacing 
the position-space WKB approximation K(x) exp[—W(x)] to p(x) by a smeared-out equiv- 
alent one, or ones, involving integration over the manifold. As derived, these 'momentum 
space' approximations are accurate only to leading order as e — > 0, since subdominant terms 
in e arising from the method of steepest descent have been neglected. But such terms could 
be incorporated, if desired, by adding e-dependent corrections to the transformed prefactor. 

If the new formulations of the WKB approximation are equivalent to the old, why have 
we introduced them? The reason is that the equivalence holds only at points x at which 
K is finite. At singularities of K, the new formulations provide a means of computing the 
true e — > asymptotics of p. Moreover, they reveal how at least some singularities of K can 
be explained as artifacts, arising from the way in which K is computed from K. It follows 
from (|66|) that if the determinant of the Hessian matrix d 2 W / dx^Oxj = dp,Jdx.j diverges at 
some point x, then x will be a singularity of K whenever K is nonzero at the corresponding 
momentum p = p(x). In other words, singularities of K may be more apparent than 
real: they can arise from points (aj, p) on the manifold where K does not actually diverge. 
A similar effect can arise from the representation ([67]) , or from any other diffraction integral 
representation. 

The matrix dpi/dxj is a matrix of partial slopes, which specifies (to first order) the shape 
of the manifold in the vicinity of the point (x,p) = {x,p{x)). Its determinant becomes 
infinite only when at least one of its elements is infinite. Such a blowup occurs only at 
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locations on the manifold where the (n-dimensional) tangent hyperplane to the manifold 
'turns vertical,' i.e., points along a momentum direction in the 2n-dimensional phase space. 
This is precisely the behavior one sees at a fold, as in Figs. |2|(c), |2|(d), and ^. The folding 
over of the Lagrangian manifold can create singularities of K. This is clearly the cause of 
the singular behavior of K at the on- axis foci occurring in models with a bifurcated MPEP, 
though not of the singular behavior at the nascent cusp occurring at criticality (which cannot 
be transformed away). 

Whether or not a singularity of the prefactor K occurring at some point x = x* is an 
artifact of this sort, by employing an appropriate diffraction integral representation one may 
compute the true weak-noise asymptotics of p{x*). One usually finds leading-order behavior 
of the form const x e~ a exp[— W(x*/e)], where a is by definition the singularity index of x*. 
In fact the e-dependent prefactor e~ a should appear at all points x within some e-dependent 
distance of x* , which shrinks to zero as e — > 0. Within this local region an asymptotically 
exact formula for p, derived from the integral representation, will be uniformly valid. This 
asymptotic approximation (non-WKB, at least in the traditional sense) will match in the 
far field to the WKB approximation K(x) exp[— W{x)/e\. 

In Sections [7| and § we shall see how this 'glueing in' procedure works, both in models 
with a bifurcated MPEP and in models at criticality. For the moment we note only that the 
construction of a local approximationto p, near the singular point x* , depends crucially on 
the determination of the_behavior of W and K near the corresponding point p* in momentum 
space. The case when K is well-behaved ('slowly varying') in a neighborhood of^p*, and the 
singularity at x = x* is an artifact, is the simplest. Suppose that W = W(p) can be 
expanded in a power series around p = p*. The matrix d 2 W/dpidpj must have a zero 
eigenvalue at p = p* , since otherwise the determinant of its inverse d 2 W/ dxidxj would not 
tend to infinity cLS X ^ X *, the Lagrangian manifold would not turn vertical there, and the 
singularity in K would not appear. The term catastrophe is used to describe what happens 
to the manifold at x — x* . It is a standard result, due largely to Arnol'd ||, that if the 
manifold is smooth near (x*,p*), the catastrophic behavior at x = x* can be captured 
by approximating W = W(p) by one of a handful of polynomial functions. These are the 
'structurally stable' elementary catastrophes. 

A single example, illustrating the similarity to the Ginzburg-Landau theory of phase 
transitions, will suffice. In n dimensions, suppose that a singularity at x = x* arises as an 
artifact in the above sense, and that p* = dW/dx(x*). In appropriate (linearly transformed) 
coordinates, write 

z = (zi, . . . ,z n ) = x - x* (69) 
9 = (9i,---,9n) =P~P* (70) 

A particularly common sort of catastrophe (a 'cuspoid') would be described locally by a 
single-variable Legendre transform of the form 

iV W (Zl, 9n) = - — - " " " - an - lZ :~ 191 + R ^ ■ ■ ■ > ( 71 ) 

n + 2 n 2 
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where ao, . . . , a n _i are constants, and R(zi, . . . , z n -\) is a quadratic polynomial. Since z n = 
dW( Zn > / dg n , this expression implies 

z n — z n( z l, . . . , Z n -i,g n ) = —dQg™ +1 — (JiZxS'n 1 — • • • — 0"n-l z n-\9n- (72) 

The presence of a catastrophe at (zi, ... , z n -i, <7n) = (0, . . . , 0, 0) is signalled by the fact that 
d 2 W/dg 2 = dz n /dg n equals zero there. 

We have already seen the n = 2 version of eq. (|72|) in Section |5.4| , as a phenomenological 



description of the shape of the manifold M. near an on-axis focus. Recall that we interpreted 
eq. (|43"f) , which is the n = 2 version, in thermodynamic terms: as the equation of state of a 
substance undergoing a Ginzburg-Landau second-order phase transition. (E.g., Z\ is T — T c , 
Z2 is a negative magnetic field, and #2 is magnetization.) Equation ([72]) is in fact a normal 
form for the shape of a Lagrangian manifold near a cuspoid singularity. When n = 2, the 
cuspoid is a cusp. If n = 1, only the first term on the right-hand side of ( [72]) is present, and 
the cuspoid reduces to a quadratic fold. 

In general, to each possible polynomial expression (normal form) for the Legendre- 
transformed action, there corresponds a non-WKB approximation to p in a local region 
near x = x*, computed from the appropriate diffraction integral. These integrals serve to 
define the canonical diffraction functions first explored by Maslov. The canonical diffraction 
functions include the classical Airy and Pearcey functions, which arise from folds and cusps 
respectively ||. We shall study the cusp case further in the next section, as a warmup for 
the study of the nascent cusp appearing at criticality. The normal form for the action near 
a nascent cusp will turn out to be nonpolynomial, but the Maslov- WKB technique will still 
apply. 

We close this section by noting that diffraction integral representations are also useful 
for incorporating symmetry constraints and boundary conditions. As an example of this, 
consider behavior near the saddle point of a double well model. We emphasized in Section |5.1| 
that if no bifurcation of the MPEP has occurred, the WKB tube of probability density 
centered on the axis will be well behaved as the saddle is approached. In particular, ^(i) = 
d 2 W/dy 2 (x,0) will tend to 2|ui(0)| as x -> 0+ Since 

^(0,0) = |f (0,0) = -2^(0) (73) 

and vq(x) = u x (x, 0) is assumed to be smooth, in the absence of bifurcations W will to leading 
order be locally quadratic at the saddle. If u(x, y) ~ (X x x, —\X y \y) is the linearization of the 
drift at the saddle, we have «i(0) = — |A y | and v' Q (0) = X x . So, near (x,y) = (0,0), 

W(x, y) w W(0, 0) - X x x 2 + \X y \y 2 . (74) 

And 

W(p x ,p y ) w -W(0,0)-p 2 x /4X x +p 2 y /4\X y \ (75) 

W^( Px ,y) « -W(0,0)-p 2 j4X x -\X y \y 2 (76) 
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will be the leading-order approximations to the Legendre-transformed actions. 

Since the Hessian matrix d 2 W/dpidpj is not negative definite, an integral representation 
of the type flSSD is not appropriate. But a representation of the type fl67]) may be used. 
In the absence of bifurcations K and K are well behaved near the saddle, so substituting 
(0) into (§7|) yields 



p(x, y) ~ const x je 1/2 J exp [-(sp* + j^/4A s )/e] dp* j e^ 1 ^. (77) 

If ^ here is integrated from — oo to oo, this approximation will be even in x. It will therefore 
serve as an approximation to the stationary density po(x, y) near the saddle. The integral may 
be evaluated explicitly, and the approximation reduces to the standard inverted Gaussian 
approximation 

p (x,y) ~ const x e +^ 2 A e -|A H |j,V ( 78 ) 

But when approximating the quasistationary density px(x,y) near the saddle, one needs an 
approximate solution of the Smoluchowski equation that is odd rather than even. Such an 
approximate solution is obtained by integrating p x from to oo rather than from — oo to oo. 
If this is in fact done, eq. ([T7|) reduces to (0), the standard Kramers-type error function 
approximation to the quasistationary density! 

Although error function approximations originated (with Kramers) in an entirely different 
context, they fit naturally into the Maslov-WKB framework. We conclude that diffraction 
integral representations can be modified to incorporate the effects of symmetry constraints. 
In Section |8.3| we shall use a similar half-range integration in our integral representation for 
the quasistationary density near a nascent cusp. 



7 Scaling Behavior Near a Cusp 

We can apply the Maslov-WKB method of the last section to symmetric double well models 
in which the MPEP has bifurcated, and the instanton trajectories emerging from S = (x s , 0) 
focus at a point (x/,0), with < Xf < x s . As we shall see, behavior near the focal 
point (xf,0) is best described in the language of critical phenomena. 

The Maslov-WKB method was first applied to focusing (cusp) singularities in two- 
dimensional models by Dykman et al. []E2] . Their analysis, which does not assume any sort of 
symmetry, specializes in the case of symmetry about the x-axis to the following. Assume that 
the Legendre-transformed action = yp y — W, regarded as a function of x and p y , may 
be asymptotically approximated near (x,p y ) = (xf,0) by the cuspoid (codimension n — 2) 
normal form 

WMfapy) ~ - - x f )pl - w (x). (79) 

Here a and a x are positive constants, and w (x) is simply W(x, 0), i.e., —W^(x,0). Since 
y(x,p y ) = dW^ I dp y (x, p y ) , this assumption is equivalent to 

y(x,p y ) ~ -a pl - ai(x - Xf)p y (80) 
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which is the phenomenological (Ginzburg-Landau) equation of state (f£3|), discussed at length 
in Section [5]4]. = W^ v \x,p y ) can be viewed as a Helmholtz free energy, just as W = 

W(x,y) can be viewed as a Gibbs free energy. 

The cuspoid form for is certainly consistent with the folding of the Lagrangian 

manifold M., as seen (in projection) in Figs. 0(c) and 0(d). It is also consistent with the 
quantitative asymptotics of Section |5]4]. Since p y = corresponds to y — y(x,p y ) = 0, 
( [SOD implies 
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This is precisely the near- focus blowup behavior of eqs. (45)-fl46|), which we derived analyti- 
cally from the Riccati equation ([44} ) . By comparing ( 8lD with ([44|) , we see that the constant 
ai must equal 1/vq{x = Xf), the reciprocal speed of the on-axis instanton trajectory as it 
passes through the focus. Since x — x/ is analogous to T — T c and W2 to a (negative) mag- 
netic susceptibility, the blowup of (j5T|) is analogous to the critical exponent 7 of the focus, 
in thermodynamic language, equalling unity. 

Dykman et al. use a one-dimensional diffraction integral representation, resembling (E7|) 
but with a; and y interchanged, to approximate the stationary probability density po near x = 
(xf,0). A crucial assumption is that the transformed prefactor = K^ y '(x, Py ), which 
has no direct thermodynamic interpretation, is well behaved (locally constant, or 'slowly 
varying') near (x,p y ) = (xf,0). If this is the case, and it may be approximated by a 
constant, one can construct the Maslov-WKB approximation 



K(x,y)exp[-W(x,y)/e] 

K^(x, Py ) exp { [-y Py + W^\x,p y )\ /e) dp y 



(82) 



- 1 / 2 K^\x f ,0)e- w ^ ^ 



exp 



In terms of 'stretched' variables X = (x — Xf)/e 1 ^ 2 and Y = y/e 3 ^ 4 this becomes 

e^K^ixf, Q) e -w{^h e Axw' {xffl )/^HxV2)w''^m p^a^X, a^Y), (83) 
where the primes denote derivatives with respect to x. Here the canonical diffraction function 



V(u,v) 



exp 



1 4 1 2 

-r + -ut 2 + vt 

4 2 



dt 
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is a modified (real) Pearcey function (cf. Paris 



The expression fl83|) is an asymptotically (e — > 0) valid approximation to the stationary 
density p an d quasistationary density pi, on the x — Xf = (^(e 1 / 2 ), y = 0(e 3y/4 ) lengthscale 
near the cusp (x/,0). It supplements the WKB approximation, which is singular there. 
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One sees that on this lengthscale, the pre-exponential factor in po and p\ is actually of 
magnitude 0(e -1 / 4 ). The singularity index of the cusp equals 1/4, as in physical optics. 

The absence of an V from the exponent gives rise to unusual asymptotic behavior of 
the diffraction function. The familiar Pearcey fringes of physical optics are replaced by an 
exponential slope, which becomes increasingly steep as e — > 0. Beyond the cusp (i.e., at x < 
Xf, which is analogous to T < T c ), the WKB approximation K(x, y) exp[— W(x, y)/e] is again 
valid, but W is no longer differentiable through the x-axis f28fl . This is reflected in the far- 
field asymptotics of the Pearcey function V. One can show that in the far field, i.e., as X = 
(x — x/)/e 1 / 2 — > — oo, the expression (|8~3| ) matches to a WKB approximation displaying this 
nondifferentiability. One can also show that the fold caustic emanating from (x/,0), as in 
Fig. H(c), is nonphysical. It arises from subdominant saddle points of the Pearcey integral, 
and does not contribute to the leading weak- noise asymptotics for p and p\. This is closely 
related to the fact that "optimal paths [i.e., physical instanton trajectories] do not encounter 
caustics," as Dykman et al. put it. 

Now the preceding Maslov-WKB treatment is satisfactory so far as it goes. But it leaves 
unresolved the issue of the validity of the Ginzburg-Landau approximation. The quartic 
normal form fl7|) for = W {y) (x,p y ), and the cubic equation of state (|80|) for its first 
derivative y = y(x,p y ), model a second-order phase transition with mean field (i.e., classical) 
critical exponents. Equivalently, they model the critical behavior of a system which, though 
it has a phase transition, has a smooth thermodynamic surface. In the present context, 
assuming the local validity of the Ginzburg-Landau approximation amounts to assuming 
that the Lagrangian manifold M. is smooth through the point (x,p y ) = (x/,0). Of course 
the surface turns vertical there, causing dp y /dy to diverge. The assumption is that the 
singularity can be transformed away by using x and p y , rather than x and y, as independent 
variables. 

This assumption requires proof. One could presumably justify it by analysing the smooth- 
ness (and blowup) properties of solutions of the Hamilton- Jacobi equation. But we shall 
give a different, more physical justification. First, we shall model the local behavior of W 
and W^ v > by a scaling law, as in the modern theory of critical phenomena. Our treatment 
will serve as a warmup for Section [8], where we shall analyse the much more complicated 
(nonclassical) singularity appearing in models where the MPEP is beginning to bifurcate. 

To see that a scaling law is appropriate in models with a bifurcated MPEP, consider the 
behavior of the on-axis transverse derivatives Wim = d 2m W/dx 2m (x, 0) We know 

by (fH)-(|46|) that u> 2 diverges as (x — Xf)~ x . The Riccati equation satisfied by w 2 is only 
the first of a hierarchy of ordinary differential equations, describing the evolution of the 
functions u^m as one moves along the on-axis instanton trajectory from 5* (where x = x s , 
and t = — oo) to the saddle (where x = 0, and t = +oo). For example, satisfies the 
ODE (|25|) . w 2 appears in each of the higher equations, and its blowup will induce a blowup 
of W4, Wq, . . . It is not difficult to show that 

d 2m W 

w 2m (x) = 2m (x, 0) ~ const x (x - x f y {3m - 2) , x -> x) , (85) 
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for 2m = 2, 4, 6, 



These blowup rates motivate the scaling Ansatz 



W(x, y) ~ W(x, 0) + (x - Xffh z 



y 



\X — Xf 



3/2 



X 



X, 



for the behavior of W near the cusp (x/,0). Here the exponents 2 and 3/2 are determined 
uniquely by the m-dependence of the blowup rates, and the functions h±{-) of the scaling 
variable z = y/\x — x/| 3 / 2 are not yet determined (though they must be even). This Ansatz 
is assumed to be accurate to 0((x — x/) 2 ), when y = 0(\x — x/| 3 / 2 ). We could equally well 
posit 



.r. z\x — Xf\ 3 ^ 2 



W(x f , 0) + (x - Xf)W\x f , 0) + (X * f ^ W"(x f , 0) + (x- x f ) 2 h ± (z) 

(87) 

as x — > , since we are assuming the accuracy of the scaling Ansatz only up to 0((x — Xf) 2 ). 
The first three terms in this asymptotic approximation are 'regular'; the scaling behavior 
appears only in the final, singular term. 

The exponents 2 and 3/2 are typical of a mean field theory. One can show that the scaling 
functions h± are also those of a mean field theory. They may be computed by substituting 
the scaling Ansatz (|87D into the Hamilton- Jacobi equation H(x, VW) = 0. For this, one 
needs to rewrite the Hamilton- Jacobi equation in terms of the independent variables x and z. 
Using the formula (|12|) for H, and the expansions (|i~9D, one finds 



H(x,p) 



' l + u x (x,y)p x + Uy(x,y)p y 



p 2 .2 

2 2 

& + ?L + 
2 2 

2 2 



Vq(x) + v 2 (x)y 2 H p x + u 1 (x)y + u 3 y 3 + 



Py 



Vo(Xf) ± v' Q (Xf) \x — Xf\ 



Px 



up to 0(\x — Xf\ l ) accuracy, since y 
that up to 0(\x — x^ 1 ) accuracy, 



z\x 



— Xf\ 3 / 2 . It follows from the scaling form ([86] 



Px{x,y) 



p y {x,y) 



dW 

dx 

dW 
dy 



x,y) 



-2v (x) ± 2h±(z) - (3/2)zh' ± (z) 



\X — Xf\ 



vo(xf) ± v' (xf) \x — Xf\ ± 2h±(z) — (3/2)zh' ± (z) 



\x 



(x,y) ~ \x -x f \ 1/2 ti ± (z) 



(90) 



where we have used the fact (see Section ^72|) that W'(x, 0) = w' Q (x) = —2vq{x). Substituting 
]) into (HH), and setting the coefficient of \x — x/j 1 equal to zero, yields the ODE 



(h' ± ) 2 = ±v (x f ) 4:h± - 3zh'__ 



(91) 
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It is easier to solve for z as a function of h'±, than for h± as a function of z. One finds 



z(ti± 



(92) 



where C is undetermined. But z = y/\x—Xf\ 3 ^ 2 , and by (|90|), /i' ± ~ p y /\x—Xf\ 1 ^ 2 . Rewriting z 
in terms of y and |ar — Xf \ , and h! in terms of p y and \x — Xf \ , yields 



y = y(x,p y ) ~ -Cp\ =f v (x f ) 1 \x-x f \p y 
= -Cpl-vo(x f )-\x-x f )-py. 



(93) 



If one identifies the model-dependent constant C with a , this is precisely eq. (80), the 
mean field (Ginzburg-Landau) equation of state! It is valid on both sides of the on-axis 
focus, i.e., both when x — Xf > and when x — Xf < 0. 

This derivation illustrates how one may go from the pattern of blowup rates of the 
transverse derivatives u>2m(^) = d 2m W/dy 2m (x, 0) as (xf, 0) is approached, to a scaling form 
for W, to an equation of state. The singular behavior of the WKB prefactor K can be 
analysed similarly (we only summarize the analysis). We know by (f46|) that K(x, 0) diverges 
as (x — xf)~ x l 2 when x — > xX . A scaling form 



K(x, y) ~ const x \x — x/\ q± 



V 



\X — Xf 



3/2 



X 



X 



/' 



(94) 



modelled after the scaling form (|86|) for W, may be used to approximate K away from 



the x-axis. This approximation should be accurate to 0(\x — Xf\ 
y = 0(\x — Xf\ 



-1/21 



Xf, when 



cLS OC 

| 3 / 2 ). By substituting the two scaling forms ( |8"ED and (Q) into the transport 
equation ([[7]) for K, and working to leading order near (xf, 0), one can determine the scaling 
functions q± = q±(z). It is easily verified that collecting the 0(\x — Xf\~ 3 / 2 ) terms in the 
transport equation yields the ODE 

2h'± ± 3v (xf)z\ q' ± + [hi ± v (x/j\ q± = 0, (95) 

which q± = q±{z) must satisfy. Using elementary calculus, and the fact that h± = h±(z) 
satisfies the ODE (RTJ) , one can show that eq. has solution 



const x 



But since z = y/\x — x/| 3 ^ 2 , we know by (pOj) that 



h'±(z) ~ \x — xj 



,dPy 

dy 



(x,y 



Substituting 



and ( P?D into the scaling form 
K(x, y) ~ const x 



z\x - X/| 3/2 ) 



for K reduces it to 



(96) 



(97) 



8py 



\ dy 



x,y). 



(98) 
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This asymptotic approximation is very simple, and has a profound consequence. We know 
that the transformed prefactor K^ y '(x,p y ) can be obtained from K(x,y) by dividing by a 
'Van Vleck factor,' as in (BH). We therefore have that 



K^(x,p y ) cx K(x,y) / x -^(x,y) (99) 

~ const, (100) 

since d 2 W/dy 2 = dp y /dy. This constant asymptotic approximation is accurate to leading 
order as x — > Xf, when y = 0(\x — xj\ 3 ^ 2 ). 

We have just deduced that on the appropriate lengthscale near the focus, i.e., x — Xf = 
o(l) and y = 0(\x — Xf\ 3 ^ 2 ), the transformed WKB prefactor does not diverge. K^ y \ un- 
like the prefactor K itself, is asymptotically constant near the focus. This was the crucial 
assumption made by Dykman et al, and we see that like the Ginzburg-Landau normal form 
for the Legendre-transformed action, it is justified by our scaling theory of local behavior. 

We conclude that at least in the case of a generic (cusp) singularity, by investigating the 
blowup rates of the transverse action derivatives as the singularity is approached, one can de- 
rive scaling relations for W and K, and ultimately construct a Maslov-WKB approximation 
to the stationary probability density near the singularity. This technique is not restricted to 
singularities of the classical Ginzburg-Landau type. 



8 Scaling Behavior Near a Nascent Cusp 

Finally, we can construct a scaling theory of weak-noise behavior near the 'nascent cusp' 
singularity appearing at the saddle point of any symmetric double well model, at the onset 
of bifurcation. The construction will closely parallel the construction of the last section. 
But several novel features will appear. We shall find that Legendre-transformed versions of 
the action are approximated, in the vicinity of a nascent cusp, by nonpolynomial normal 
forms. Equivalently, the nascent cusp singularity, unlike an on-axis focus, will prove to have 
nonclassical critical exponents. The exponents will depend continuously on the parameter 
fi = \X y \/X x , which characterizes the linearized drift field at the saddle. 

The universal presence at criticality of a nongeneric two-sided caustic (which, as shown 
in Fig. [|, extends sideways from the saddle point) will follow from the nonpolynomial normal 
forms for the Legendre-transformed actions. Indeed, one of the normal forms will supply 
a nonpolynomial unfolding of the nongeneric caustic. Moreover, the fact that the critical 
exponents of the nascent cusp are model-dependent and continuously varying will induce a 
continuously varying singularity index, and a continuously varying prefactor exponent in the 
non-Arrhenius weak-noise MFPT asymptotics. To see this, we shall have to go beyond the 
WKB approximation, by applying the Maslov-WKB method. In Section [8.1| we analyse the 
scaling properties of the action and the WKB prefactor, and in Section |S.2| , we compare our 
scaling formulae with numerical data. In Section |3.3| we apply the Maslov-WKB method, 
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Figure 8: A sketch of the near- axis Region N, defined by |y|/|x| 2M < const. The choice of 
constant is immaterial, so long as it is positive. Most of our expansions for the action W 
and its Legendre transforms, in double well models at criticality, are valid as (x, y) —>■ (0, 0) 
from within Region N. The region could equally well be defined by |Pj/|/|a;| 2M < const, 
M/|Pz| 2M — const, or |p y |/|Pz| 2M — const. To leading order near (0,0), the four definitions 
are equivalent. 



and construct weak-noise approximations to the stationary and quasistationary probability 
densities near the saddle. 



8.1 Scaling in the WKB Approximation 

Our scaling treatment of the nascent cusp begins with an investigation of the blowup rates 
of the transverse action derivatives w 2m (x) = d 2m W/dy 2m (x, 0), as x — » + . Up to now we 
have written down only the ODE's satisfied by w 2 (i.e., the Riccati equation (0)) and w 4 
(i.e., eq. fl25|) ). The full hierarchy of ODE's may be derived by substituting the Taylor series 
J2m=o w 2m(x)y 2m / (2m)\ for W(x,y) into the Hamilton- Jacobi equation H(x,VW) = 0, and 
separating out the coefficients of each power of y. One finds 

w 2m = -v w' 2m (101) 

m ~ l (2m\ 

W2k/2 + U 2 k-l]w2m-2k+2- [ 2 j )i W 2j / 2 + V 2j} W 2m-2j + 2v 0V2m, 




3=1 



where u 2 j+i = (2j + l)!w 2 j+i an d v 2 j = (2j)\v 2 j, and u 2 j+i and v 2 j are the drift velocity 
derivatives defined in (|19"D . As usual, the time derivative here is with respect to transit time 
of the on-axis instanton trajectory, which satisfies x = —Vo(x) as it moves from S = (x s ,0) 
to the saddle. Since Vq(x) = u x (x,0), this trajectory t i— > x*(t) moves anti-parallel to the 
drift. And since u(x,y) ~ (\ x x, —\X y \y) near (0,0), x*(t) is approximated (as t — ► +oo) 
by const x e~ Xxt . 
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We showed in Section [5^4] , by analysing the Jacobi equation satisfied by the transverse 
soft mode, that 

w 2 (x) ~ const xx 2 , x — > + (102) 
in any double well model at the onset of bifurcation (see eqs. QBTj) — Q62|) , and Fig. [7]). If the 



fact that w 2 — > as x — > + (i.e., as t — > +oo) is substituted into the general ODE ( [101 
it is easy to show, by integrating forward in time toward t = +oo, that 



w 2m lx 



const x ar (4m - 4) ^, x -> 0+ (103) 



for 2m = 4, 6, 8, . . . This pattern of blowup rates, as the saddle is approached, motivates 
the scaling Ansatz (cf. fl36|) ) 



W(x, y) ~ W(a;, 0) + |x| 4 "%/|:c| 2/i ), x -> 0. (104) 

Here /i(-) is some (even) scaling function, as yet undetermined, and the exponents 4/x and 2/x 
are determined uniquely by the m-dependence of the blowup rates of (|103|) . This Ansatz is 
assumed to be accurate to 0(|x| 4ai ) as x — > 0, when y = 0{\x\ 2 ^). We could equally well 
posit a finite-length asymptotic expansion for W(x, z\x\ 2fJ- ), namely 

/LVJ ak W k\ 
W(x,z\x\ 2 »)~ (E^MfH+M 4 ^), x-+0. (105) 

Here z = y/\x\ 2 ^ is the scaling variable. Only even powers of x appear in the summation, 
and by convention, here and below |_4/^J denotes the greatest even integer less than or equal 
to 4/i. The expansion ( |105| ) is assumed to be accurate to 0(|x| 4At ), at any fixed value of z. 
It can be thought of as an asymptotic development of W(x), as x — > o from within the 
near-axis region defined by the condition \z\ < const. This condition defines a notch-shaped 
region, which we call Region N. (See Fig. |8|.) 

It follows by differentiating (|104|) twice with respect to y that w 2 (x) ~ h"(0) as x — > 0. 
For consistency with the 'splayout' behavior w 2 (x) ~ const x x 2 of ( |102| ), we must have 
h"(0) = 0. Notice the slight discrepancy: the falloff rate of w 2 is not fully captured by the 
scaling Ansatz. Actually, this is unsurprising. A term proportional to x 2 y 2 in W, such as 
would arise from the 0(x 2 ) falloff of w 2 as x — > 0, would (in terms of x and z = y/lxl 2 ^) 
be proportional to \x\ Afl+2 z 2 . It would therefore be negligible in comparison to the scaling 
term |x| 4/i /i(z), as x — » 0. The scaling term captures the blowup as x — > of w±, wq, Wg, ■ ■ ■, 
but capturing the precise falloff rate of w 2 would require a more refined analysis. We shall 
not attempt to include in our Ansatz the 'sub-scaling' terms that such an analysis would 
require. 

The scaling function h(-) may be computed by the technique used in Section [7|. By sub- 
stituting the expression ( |105| ) into the Hamilton- Jacobi equation H(x, VW) = 0, rewriting 
the Hamilton- Jacobi equation in terms of the independent variables x and z, and setting the 
coefficient of \x\ Atl equal to zero, one obtains an ODE for h = h(z). This ODE turns out to 
be (cf. (P) 

(h'f = 2\\ y \[Ah- zh'\. (106) 
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As in Section |7|, it is easier to solve for z as a function of h' , than for h as a function of z. 
One finds (cf. 

z = = /i'/2|AJ + c(//) 1/3 , (107) 

where c is undetermined. But z = y/\x\ 2fl and p y = dW/dy ~ |x| 2/i /i'(2;). Rewriting 2 
in terms of a; and ?/, and ft/ in terms of x and p y , yields an asymptotically accurate equation 
of state (cf. (H)) 

y = y(x,p y ) ~ p,/2|A,| + c, ;x . Jx| 4 ^/ 3 , (108) 
where c y;XiPH = c. In practice the model-dependent constant c would be computed numeri- 



cally, by fitting (|108|) to the flow field of instanton trajectories in the vicinity of the saddle. 
We must have c > 0, since the map p y 1— > y is necessarily monotone increasing near the 
saddle. This is because 'whorling,' as in Fig. EL occurs only in models with a bifurcated 
MPEP. Whorling is absent at criticality, i.e., at the onset of bifurcation. 

The equation of state ( |108| ) is certainly not of the classical Ginzburg-Landau form. 



By anti-differentiating it, we can obtain an equally unusual approximation to the Legendre- 
transformed action = yp y — W, where p y = dW/dy. Since y = dW^ v ' / dp y , we neces- 
sarily have 

W"\x,p,) ~ -W(x,Q)+pl/H\,\+C X}Pt \x\ 4 '"Y," (109) 

+pI/M\\ + c, p m^V/\ (no) 

where C XjPy = 3c/4. This asymptotic approximation should be accurate to 0(|a;| 4M ), when 
p y = 0(\x\ 2fl ) [i.e., when y = 0(\x\ 2fl ), or when x — > o from within Region N]. 

The formula (|110|) can be called a nonpolynomial normal form for the transformed ac- 
tion near the nascent cusp. Notice that as x — > o, the final, nonpolynomial term 
Cx^M 4 ^ 3 ^ 3 is significant in a relative sense only within Region N. In the far field of the 
p y = 0(|x| 2m ) lengthscale, as x — > it is increasingly dominated by the p 2 term, and the 
normal form reduces to a polynomial. The Cx^N 4 ^ 3 ^ 3 term plays a much more impor- 
tant role in the near field. One can think of (|110|) as providing an interpolation between the 
non-polynomial asymptotic development that is valid as x — > o from within Region N, and 
the polynomial development that is valid as x — ► o from within its far field. The scaling 
behavior is visible only within Region N. 

It is worth noting that despite its asymptotic validity, the nonpolynomial normal form (fTT0) 
does not fully capture the p y — > behavior of W^(x,p y ) at fixed, nonzero x. If the 
nonanalytic p 4 / 3 falloff were exact, it would follow by differentiating twice with respect 
to p y that d 2 W^/dp 2 , i.e., dy/dp y , would diverge as p y — > 0. This would imply that 
w>2 = dp y /dy(y = 0) would be identically zero at any nonzero x. But we know that 
W2(x) ~ const x x 2 , x — > 0. The discrepancy is due to the fact that the nonzero W2 near x = 
arises from 'sub-scaling' behavior that we are not attempting to model. It is not difficult to 
see that at fixed nonzero x, the apparent nonanalyticity at p y = must be 'rounded' at a 
lengthscale p y = 0(|x| 2m+3 ), or equivalently at y = 0{\x\ 2fl+1 ), to yield consistency with the 
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W2{x) ~ const x x 2 asymptotics. However, on the p y = 0(\x\ 2fl ) lengthscale the rounding 
becomes invisible as x — > 0. 

With its continuously varying (in general, irrational) exponent 4/x/3, the normal form 
for W {y) in Re gion N looks quite different from the normal forms of catastrophe theory || |J . 
Its most striking feature is the non-analyticity at (x,p y ) = (0,0), which can be interpreted 
thermodynamically. Recall that W' v > (like W^ x \ W , and W) can be viewed as a thermo- 
dynamic potential on the thermodynamic surface (i.e., Lagrangian manifold) M.. In fact, 
through its derivatives it determines the shape of A4. So at (x,p y ) = (0,0), or equivalently 
at (x, y) = (0, 0), the surface M. will itself be non-analytic. However, as /x increases, be- 
comes increasingly differentiate (with respect to x, at least) at x = 0. The order of the 
'phase transition' appearing at the saddle at criticality is, therefore, an increasing function 

of /!. 

We can Legendre-transform the normal form for W^ y > to obtain a normal form for the 
double Legendre transform W = x ■ p — W = xp x + W^ y \ as a function of p x and p y . 
A further Legendre transform will yield a normal form for the remaining thermodynamic 
potential, W^ x \ We sketch only the first of these two computations. Differentiating (|109| )- 
( [11 OP with respect to x, and using p x = dW/dx = —dW^/dx, yields 



Px=Px(x,p y ) ~ p x (x,0)+C } 



p x ;x,p y 



I |4w/3-l 

\x\ sgnx 



-2X x x + ■■■ + const x x L4mJ 1 ) + c Px]X , Py 



I |4it/3— 1 

\x\ sgnx 



(in) 

(112) 



where c Px . XtPy = -(4^/3)C X|Pv 



-/xc. Here we have used the fact that W(x, 0) = —2v (x) 



|4^t — 1> 



cLS X 



0. 



so that W"(0, 0) = — 2A X , etc. This approximation is accurate to 
when p y = 0(\x\ 2tJ- ) [i.e., when x — > o from within Region N]. If /x > 1/2, it is easy to invert 
the series ( [L12 ) to approximate x = x(p x ,p y ). The — 2X x x term is dominant, and inversion 
yields 



x 



x(Px,Py) 



x{p x ,0) + C x - 



Px,Py 



\Px 



4/i/3-l 



sgnp x 



-P. 



.J2X X + ■■■ + const x p^ 1 ' 1 ) + c 



\Px 



4/i/3-l 



sgnp a 



(113) 

pfim 



where = -(2A a 

must have 



-4a»/3 



Px\X,Py 



(2A a 



-4 M /3 



/xc. Since x(p x ,p y ) = dW / dp x (p x ,p y ), we 



W^Py) ~ W(p x ,0)+p 2 /4\X y \ + C Px , Py \p x \^p 



4/x/3 4/3 



(115) 

(_^(0, 0) - p 2 x /4X x + ■■■ + const x p^) + p 2 y /A\X y \ + C Px , Py \p x \^0f6) 
-W(0,0) -p 2 x /4X x +pl/4\X y \+C Px , Py \ Px \^yj\ (117) 



where C PxjPy = / ^p)Cx;p x ,p y — (3c/ ' A){2X X )~ 4: ^^ . The momentum-space normal forms (|115|) 
and should be accurate to 0{\p x \ A ^) as p x — > 0, when p y = 0{\p x \ 2 ^). This is simply 

a momentum-space version of the condition that x — > o from within Region N. 

It is useful to compare the truncated normal form (|117|) with (|75|) , the quadratic approx- 
imation to 14 7 that is valid near the saddle point in the absence of focusing. We see that 
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W(p x ,p y ) ~ W{ Px ,0)+ P 2 y /A\X y \+C PxtPy \ Px \^Y y /3 

„ (_ W (0, 0) - p£/4A« + ■ ■ • + const x pL-^J) + ^ /4)A j + C Px , Py \p x \^ 3 pf 

W^\p x ,y) ~ W ?(x) (p.,0)-|A y |y 2 + (7 fc , I ,|ft,|^/ s yV8 

„ (_ W(0 , 0) - p 2 /4A, + • • • + const x pL^J) - |A> 2 + C Px Jp x \^Y /3 

W^\x,p y ) ~ ^^(^,0)+p2/ 4 |A,|+a, P Ja;| 4 ^ 3 pf 

~ (-W(0, 0) + A x x 2 + • • • + const x x^ J ) + p 2 y /A\X y \ + C x , Py \x\^ /3 p y /3 

W(x,y) ~ W(z,0) + |x| 4m %/|x| 2/x ) 

~ (W(0, 0) - A x x 2 + ••• + const x x^) + |x| 4 ^%/|z| 2 ' i ) 

Figure 9: Normal forms for the thermodynamic potentials (the Legendre transforms 
of the action W , and W itself) in the vicinity of a nascent cusp. In terms of the 
model-dependent constant c, C Px:Py = (3c/A)(2X x )-^/ 3 , C Px:V = (3c/4) (2 A^) ~ 4 ^ 3 (2 1 A y | ) 4 / 3 , 
and C XiP = 3c/ '4. These asymptotic expansions are valid in Region N, in critical double well 
models with fi > 1/2. They are accurate to 0(|a;| 4M ), or equivalently to 0(\p x \ 4 ^). 



the fact that a double well model is 'critical' modifies the double Legendre transform W 
near the saddle in a very simple way: it adds the final, nonpolynomial term. In a sense, the 
coefficient C Px)Py measures the strength of the nascent cusp singularity at the saddle. 

The computation of the remaining thermodynamic potential, W^ x \ is left to the reader. 
In Figure || we list the normal forms for W, W^ x \ and W^ y \ as well as the scaling form for W. 
The expressions listed there are accurate to 0(\x\ 2fl ), i.e., to 0(\p x \ 2fl ), as the nascent cusp 
is approached from within Region N. In Figure [1^ we list the four possible equations of state 
for x and y. They are accurate to 0(\x\ 2 ^~ l ), i.e., to Odp^l 2 ^ -1 ), in the same limit. 

We emphasize that the normal form ( |116| ) for W, the normal form for W^ x \ and the 
equations of state that follow from them, are valid only for critical models with \i > 1/2. 
The reason is that when p < 1/2, the final term in (|112|) , which when p y = 0(|x| 2m ) is of 
magnitude 0(x 4m_1 ), is at least as large as the — 2X x x term as x — > 0. In fact when // < 1/2, 
in Region N (except on the x-axis) the leading asymptotics of p x = p x {x,p y ) are not linear 
in x. This makes difficult the computation of asymptotic approximations to x = x(p x ,p y ) 
and W = W(p x ,p y ). For this reason we shall assume fi > 1/2 henceforth. 

It is a reasonable conjecture that in critical models where the symmetrical approxi- 
mation ( p. 1 7| ) to W = W(p x ,p y ) is valid, it is valid not merely near the p x -axis (i.e., in 
Region N), but uniformly as p — > o. One would like to substitute it into the Maslov-WKB 
diffraction integral (p5j), so as to obtain boundary layer approximations to the stationary 
and quasistationary probability densities near the saddle point (0,0). The approximation 
to the quasistationary density would be a replacement for the usual Kramers-type error 
function approximation, (|31~D. From it, one could derive an Eyring formula for the MFPT 
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y(px,p y ) 



•EyPxi 0) ~\~ Cx;p x ,y \Px 



1 4^/3-1 



SgV-Px 



, 4 /3 



-p x /2X x + ... + const x p^ x ) + c x;P:C)2/ \\p x 

x(p x ,0) + c x;Px>Py [|p, ^' 
-p x /2X x + . . . + const x p a 



1 4/x/3-l 



sgnp^ 



,i sgnp x 
L4H-1 



+ c x; 



Px,Py 



\Px 



4/i/3-l 



sgnp x 



Py/2 | A y | + c 



1/3 



Py/2\X y \+C y;PxiPy \p x \ 4( * /3 pl /3 



4/3 



Figure 10: The four asymptotic equations of state, which describe the shape of the 
Lagrangian manifold M. in the vicinity of a nascent cusp. It follows by differentiat- 
ing the normal forms listed in Fig. || that c x; i t j = (4fi/3)Ci t j and c y -i t j = (4/3)C/ i j. 



So c X ;p X:Py 
and Cy- X ^p y 



/ ic/(2A 1 .) 4 ^ /3 , 



c 



x;px,y 



/ic(2|A,|) 4 / 3 /(2A x .) 4 ^ 3 , 



y,Px,p y 



c/(2A 3 



\4/i/3 



C. 



These asymptotic expansions are valid in Region N, in critical double 



well models with /i > 1/2. 



asymptotics, as in Section |5]^. Unfortunately there is a problem. If C Px , Py = and ( |117|) be- 
comes quadratic, the Hessian matrix d 2 W / dpidpj is clearly not negative definite. As we 



noted in Section g, this precludes the use of the two-dimensional diffraction integral (|65|). 
The situation does not improve much if C PxiPy is positive, so it is preferable to use an al- 
ternative integral representation. The asymptotic approximation to the Legendre transform 
W^> = xp x — W = —yp y + W, as a function of p x and y, is listed in the table in Figure |[ 
A truncated version of it would be 



W^( Px ,y) « -W(0,0) - P l/AX x -\X y \y 2 + C y ,p x \ Px \^Y /3 , 



118) 



which is a nonpolynomial modification of the Gaussian approximation (|7q). This approxi- 



mation is precisely what is needed in the one-dimensional diffraction integral (p7|) , which is 
what we shall use instead of (|65|). 

The reader may wonder about the domain of validity of the approximation ( |118|) to 
W( x ) = W^(p x ,y). Is it valid outside Region N? In Section |8.2| we present numerical 
evidence that it is, in fact, a useful asymptotic approximation near the y-axis, even at fixed, 
nonzero y. Indeed, it explains the mysterious 'sideways' caustic of Fig. |j! To see this, 
differentiate ( |118|) with respect to p x to get 



x = x(p x ,y) « -p x /2X x + c x . Px>y y 4/3 \p x 



|(4/V3)-1 



sgnp x 



(119) 



which is a truncated version of the asymptotic expansion of x = x(p x , y) listed in Figure |TD[ 
If fi < 3/2, the formula (|119|) predicts that at any nonzero y, the map p x i— ► x will not be 
monotone. This is because the coefficient c x;PliS is positive. By examination, if 



< 



const x \y 



(3/2-m)- 



0. 



(120) 
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then the inverse map p x = p x (x,y) (and hence W = W(x,y)) will be multivalued. This 
inequality defines a two-sided nongeneric caustic, which emanates from (0, 0) along the 
positive and negative y-axes. 

In the language of catastrophe theory, the formula ( 119|) is a (non-smooth) unfolding of 



the nongeneric caustic emanating from the nascent cusp. It also resembles a thermodynamic 
equation of state in the vicinity of a phase transition. However, the thermodynamic inter- 
pretation of the variables differs from the case of an on-axis focus, as analysed in the last 
section. Here \y\ is analogous to T c — T, for example. And by examination, the thermody- 
namic critical exponent 7 is nonzero whenever /i < 3/2; it equals (3/2 — In any event, 
the critical exponents of the nascent cusp are clearly nonclassical: they depend continuously 
on the parameter /1. 

We caution the reader that in arbitrary double well models at criticality, the nonpoly- 
nomial approximation ( |11§| ) and the nonclassical equation of state (|119|) may not neces- 
sarily describe the p x — > behavior of x = x(p x ,y) at fixed, nonzero y. If \i < 3/4, the 
y^ 3 IPxI^ 3 ^' 1 sgnp x term in ( |119| ) would cause x = x(p x ,y) to diverge as p x — > 0, at any 
nonzero y or p y . Such a divergence would greatly distort the shape of the Lagrangian mani- 
fold M.. So we shall assume fi > 3/4 henceforth. In Section |0| we present numerical evidence 
of the need for the \l > 3/4 restriction, and also verify that the nongeneric caustic is present 
if and only if fi < 3/2. Incidentally, our numerical results indicate that at fixed nonzero y, 
the apparent non-analyticity at p x = is 'rounded' at a sufficiently small (|y|-dependent) 
lengthscale, as p x — * 0. This is analogous to the abovementioned rounding of W^(x,p y ), 
and its derivative y = y(x,p y ), as p y — > at fixed nonzero x. 

There is also a problem with the nonpolynomial approximation (|118|) and the nonclassical 
equation of state ( |119| ) when fi > 3. To see this, note that a more complete asymptotic 
expansion of x — x(p x , y) near p x = would presumably be of the form 



x 



x(Px, y) « (-p x /2\ x + ... + const x pJ-^J x ) + c x . PxiV y A/3 \p x \^ /3 1 sgnp x 



1211 



Such an asymptotic expansion is listed in Fig. 10, and is certainly valid as x — >• o from within 



Region N. If a similar expansion is valid near the y-axis, we see that there will be a crossover 
at [L = 3 between two regimes. When fi = 3, the nonpolynomial c x;Pxty y 4 ^ Ip^l^^ 3 " 1 sgnp x . 
term in ( 121|) becomes c XjPx>y y 4 ' 3 p x . This is increasingly dominated by the p\ term in (|121|) 



as y — > 0. In fact when fi is raised above 3, at small \y\ the leading corrections to the naive 
x ~ —p x /2X x behavior are no longer given by the nonpolynomial term, but rather by the 
p\ term. For this reason we shall assume for the remainder of our analysis that /i < 3 as well 
as [i > 3/4. 

To use the one-dimensional Maslov-WKB diffraction integral (|67D as promised, we need 
to approximate in the vicinity of the nascent cusp at (p x ,y) = (0,0) not only the Legendre- 
transformed action W^ x '(p x ,y), but also the transformed prefactor K^ x '(p x ,y). It may be 
approximated in a very similar way, which we only summarize. By (|61|)-(p2|), K(x, 0) diverges 
in any critical model ctS J X I ^, when x — > 0. A scaling form 

K(x,y)~\xmy/\x\ 2 n, (122) 
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modelled after ( |9~4] ) and ( |104| ), may be used to approximate K away from the x-axis. The 



0, when y = 0(\x 



2fi\ 



U.e., as 



approximation (|122|) should be accurate to 0(\x\ r j as i 
x — > o from within Region N]. By substituting ( |122| ) and the scaling form ( |104| ) for W into 
the amplitude transport equation (|17D , and working to leading order near (x, y) = (0, 0), one 
can determine the scaling function q = q(z). The procedure closely resembles the procedure 
used in Section |7|. It is easily verified that collecting the 0(|x| _/i ) terms in the transport 
equation yields the ODE 

2[ti + \\ y \z\q' + ti'q = 0, (123) 

which q = q(z) must satisfy. (Cf. (|95|).) Here h = h(z) is the scaling function for W, which 
satisfies the ODE ( |106| ). Using elementary calculus, one can show that eq. ( |123[ ) has solution 



q(z) = const x 



-1/3 



-h"{z). 



(124) 



(Cf. (|96|) .) But since p y = dWj dy ~ \x\ 211 h', one may write \x\ 2 ^p y for h', and dp y / dy for h'' 



Substituting (|124|) into the scaling form (|122|) , and performing the indicated rewriting, yields 

-» /3 \Py 



K(x,y) ~ const x \x\ 



1-1/3, 



d Py 



dy 



(125) 
o from within 



(Cf. (^).) This asymptotic approximation is exact to leading order as x 
Region N. 

The formula ( |125| ) facilitates the computation of the transformed prefactor K^ v > = 
(x,p y ). It may be computed from K as in (|99D , by dividing by the appropriate 'Van Vleck 
factor.' We immediately find 



K (y> (x,p y ) oc K(x,y) / \J—g^( x ^y) 



const x \x\ 



-m/3 



\Py 



-1/3 



(126) 
(127) 



since d 2 W/dy 2 = dp y /dy. (Cf. (p9|)-( |l00| ).) The uncomplicated asymptotic approxima- 
tion ( |127| ) should be accurate to 0(|x|" 



OC - CL OC 



const x \a\ 



) as x — > 0, when p y = 0{\x\ 2 ^). It simply says that 
as x — > 0, for any nonzero a. 



The two remaining transformed prefactors, K and K^ x \ may be computed from by 
dividing (or multiplying) by the appropriate Van Vleck factors. (Cf. (|66|) and (|68D.) For 
example, 



K(p x ,Py) OC K^ V) (X,Py), 

The details are left to the reader. One finds 



\ dp. 



d 2 W, 



K(p x ,Py) 

K {x \p x ,y) 



const x \p x \ 
const x \p x \ 



-M/3 
-M/3I 



-1/3 

1 

-1/3 



(128) 



(129) 
(130) 
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The transformed prefactor is the one we need for the Maslov-WKB diffraction inte- 
gral. In Section |8.2[ , we examine the numerical evidence for the validity of this asymptotic 
approximation to = K^ x '(p x , y), when p x — > at fixed nonzero y. 

Remarkably, the formula (|130|) predicts that K^ x \p x ,y) diverges at the location (p x ,y) = 
(0, 0) of the nascent cusp. This is different from the case of a generic (cusp) singularity, 
treated in Section |7|. It is also different from the geometrical optics limit of physical optics, 
where the transformed amplitude function near a singularity is normally a 'slowly varying' 
(i.e., non-singular) function Q. The fact that the transformed prefactor K^ x ' diverges at 
the nascent cusp is at least as important to the weak-noise behavior of critical double well 
models as the fact that the normal form for the transformed action W^ x > is nonpolynomial. 



8.2 Comparison with Numerics 

We now summarize the numerical evidence for the validity, in double well models at crit- 
icality, of our nonpolynomial normal form for the Legendre-transformed action = 
W^ x \p x ,y), and our approximation to the transformed WKB prefactor K^ x > = K^(p x ,y). 
We shall see that both are valid approximations near the y-axis separatrix; in particular, 
near the saddle point. This justifies their use in the Maslov-WKB method, which we shall 
employ in Section |3.3| to construct boundary layer approximations to the stationary and 
quasistationary probability distributions of double well models at criticality. 

We begin by examining the evidence for the nonpolynomial normal form ( 118 ) for 



W( x \p x ,y). Actually we shall study the related nonpolynomial approximation ( 119|) to its 
first derivative x = x(p x ,y), i.e., 



x = x(p x ,y) ss -p x /2\ x + c x . iPxty y* r6 [\p x \ ( - 4 <* /3) 1 sgnp x \. (131) 

As explained above, we expect on theoretical grounds that this approximation is generically 
valid near the saddle point, in critical models in which the quotient fi = \X y \ /X x satisfies 
3/4 < < 3. The formula ( [131| ) predicts that at nonzero y, the correspondence p x i— ► x 
is monotone if fi > 3/2, but non- monotone at nonzero y if fi < 3/2. When \i < 3/2, the 
correspondence p x i— ► x is analogous to the correspondence m i— > —h, in a ferromagnet, 
between magnetization and (negative) magnetic field. 
It is easily checked that when 

|x| ~ const x M (3/2 ~ M)_1 , y -> 0, (132) 

the inverse map x i— > p x is three-valued rather than single-valued. In this region the three 
possible values for p x are by examination of the same magnitude as x, i.e., 

p x = 0(\yf' 2 -^ 1 ), y^O. (133) 



We interpret the inequality ( |132| ) as defining a two-sided nongeneric caustic centered on the 
y-axis, in the interior of which the action W, and its gradient p = VW, are three-valued. 
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Figure 11: The flow field of instanton trajectories emanating from both stable fixed points 
S and S', in critical versions of the standard double well model (^). Parts (a), (b), (c), (d) 
correspond to models with fi = 0.725, 0.85, 1.15, and 1.6. In all cases the parameter a is set 
equal to the critical value a c = 2fi(fi +1), at which the MPEP bifurcates. The two-sided 
nongeneric caustic of Fig. |] is visible in parts (b) and (c), but it has separated into two 
generic caustics in part (d). 
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Recall that the Lagrangian manifold Ai, which is traced out by WKB bicharacteristics, 
comprises all points in phase space of the form (x,p(x)). The three-valuedness of W and p 
within the caustic accordingly implies that there are three points on M. 'above' any point x 
in the interior of the caustic. We have already seen in Fig. | that a nongeneric caustic 
qualitatively agreeing with this prediction does indeed appear in the fj, — 1 standard double 
well model @, at criticality. 

Figures |TT|(a)-[TTl(d) show the flow field of instanton trajectories, i.e., projected bicharac- 
teristics, in several more critical variants of the standard double well model. (At criticality 
a = a c = 2/i(/i + 1), by eq. (|58|) .) Figures |Tl](b) and [ll|( c )> with /z = 0.85 and /i = 1.15, 
illustrate the fact that the two-sided caustic of Fig. [| appears at criticality in any double 
well model whose parameter /i = \X y \/X x satisfies 3/4 < ji < 3/2. The caustic disappears, 
as expected, in critical models with /i > 3/2. Figure |ll](d) shows what happens. As fi in the 
standard model is raised above 3/2 (with a set equal to a c = a c {fi)), the two-sided caustic 
separates into two one-sided generic caustics, whose cusps move out along the positive and 
negative y-axes, away from the saddle. In any critical model with fi > 3/2, there is a portion 
of the separatrix near the saddle that is not crossed by any instanton trajectory. 



Figure |TT|(a) illustrates the bizarre behavior that occurs in critical models with /i < 3/4. 
At first glance it seems that the now familiar two-sided caustic is present, but closer study 
reveals that points in its interior are reached by only two instanton trajectories, rather than 
three. Apparently, in the // < 3/4 regime the approximation ( |131|) breaks down near the 
separatrix. Empirically, when fi < 3/4 the Ipx]^^ 3 ^ 1 factor in ( |131| ) must be replaced by 
unity. The /i < 3/4 regime is still under investigation, and we shall not consider it further 
in this paper. 

We can now compare the predictions of our scaling theory with numerical data. Fig- 
ure |T^(a) is a section through the caustic of Fig. |TT](b), i.e., a cross-section through the 
corresponding Lagrangian manifold. It shows the correspondence p x h- > x, at y = 0.05, in 
the \i = 0.85 standard model at criticality. The qualitative shape of the curve certainly 
resembles the prediction of formula ( |131| ). But before making a quantitative comparison, 
we need to discuss the interpretation of ( |131| ). It was derived from an asymptotic develop- 



ment of the action about the saddle point. To what extent does it describe the small- \p x \ 
asymptotics of x = x{p x , y) at fixed, nonzero y! That is what is plotted in Fig. |12. 

From a rigorous point of view, when < 3/2 the formula ( |131| ) provides a two-term 
asymptotic expansion of x = x(p x ,y) as y — > 0, on the p x = 0(\y\^ 3 ^ 2 ~^ ) lengthscale 
on which the nongeneric caustic is visible. This is strongly reminiscent of the 'Region N' 
constraint of the last section. There we began by approximating W = W(x,y) in the 
notch-shaped region of Fig.J| Here we are approximating x = x(p x ,y), and by extension 
its anti-derivative = W^ x \p x ,y), in a region that is similarly notch-shaped, but is 

centered on the y-axis rather than the x-axis. We shall not attempt to expand x and 
systematically, but the basic procedure is plain. If we define a new scaling variable 

z=p x /\yf/ 2 -^\ (134) 

then formula ( |131| ) can be interpreted as comprising the first two terms in an asymptotic 
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Figure 12: A section through the nongeneric caustic appearing at criticality in the /i = 0.85 
standard double well model, as shown in Fig. |ll|(b). This section is taken at y = 0.05. Parts 
(a) and (b) show a linear and a logarithmic plot of x = x(p x , y = 0.05). The scaling behavior 
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at p x = lQ- r °. 
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development of x(z\y\^/ 2 ~^ 1 , y) as y — > 0. The development should be valid at any fixed z. 

Though this restatement is a bit pedantic, it suggests that on smaller lengthscales than 
Px — 0(\y\( 3 / 2 ~^ ), the formula (1 13 lj ) might be invalid. Actually there are strong reasons 



for believing that the nonanalytic \p x 



(4 M /3)-l 



sgnp x behavior as p x — * does not appear 



at fixed, nonzero y. If it did, dx/dp x (p x = 0) would diverge at criticality, at y ^ 0, in any 



model with 3/4 < /x < 3/2. Equivalently, dp x /dx(x = 0), 



i.e.. 



d 2 W/dx 2 



x 



0), would be 



identically zero, irrespective of the choice of nonzero y. But this prediction is too simple: 
it ignores the presence of 'sub-scaling' terms. We noted in the last section that at criticality, 
W = W(x,y) should contain an x 2 y 2 term, for consistency with the sub-scaling W2(x) ~ 
const x x 2 behavior. In other words, d 2 W/dx 2 (x = 0) near the saddle should be nonzero and 
proportional to y 2 . For consistency with this prediction, the nonanalytic |p x | ( ' 4M ^ 3 ' ) ~ :L sgnp x 
behavior of x = x(p x ,y) must be 'rounded' at sufficiently small \p x \. It is easy to check 
that p x = 0(|y|( 5 / 2 )( 3 / 2- ^) ) is the correct lengthscale. On that lengthscale, one should find 
W(x,y) ~ const x x 2 y 2 , i.e., p x ~ const x xy 2 , or x ~ const x y~ 2 p x . 

What we conclude from this discussion is that at fixed, nonzero y, the nonpolynomial 
formula (|131|) for x = x(p x , y) should be valid on the 'caustic lengthscale' p x = 0(|yp 3 ' 2 ~w ), 
but that it will break down when p x is decreased to 0(\y\( 5 ' 2 >( 3 / 2 ~^ ). On that smaller 
lengthscale, one expects a crossover to a linear regime, where x is proportional to p x . We can 
now proceed to our comparison with numerics. In Fig. |T2"Kb) we plot the correspondence 
Px !— * x{p x ,y = 0.05) of Fig. [I2|(a) on a logarithmic scale. We also fit two trendlines to it: 
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X oc \p x \^^~ x and x oc p x . As the two trendlines reveal, our theoretical analysis is perfectly 
confirmed. There is indeed a crossover to a linear, sub-scaling regime when p x is decreased 
to Od^ 5 / 2 ^ 3 / 2- ^ -1 ). But at larger lengthscales, e.g., p x = Od^ 3 / 2- ^" 1 ), the fractional 
power Ipxl*- 4 ^ 3 ) -1 sgn^ of the scaling formula ( |131|) is clearly visible. Similar crossover plots 



can be obtained for other critical double well models, whose parameter p = \X y \/X x lies in 
the range 3/4 < p < 3/2. 

Our asymptotic approximation K^(p x ,y) ~ const x |p x |~^ 3 |y| -1 ^ 3 to the transformed 
prefactor K^ x > at criticality, derived in Section |8.1], can also be numerically tested. The 



approximation should be valid in the vicinity of the y-axis separatrix, i.e., as p x — > at fixed, 
nonzero y. There are two separate cases: 3/4 < p < 3/2, when a caustic is present, and p > 
3/2, when one is not. For simplicity we consider only the latter. When p > 3/2, it follows 
from ( |131| ) that x(p x ,y) ~ —p x /2X x as p x — > 0; the nonpolynomial term is subdominant. 
So our asymptotic approximation to K^ x > implies that (cf. (|68|)) 



K(x,y) oc K^(p x ,y)J-^(p x ,y) (135) 



K™(P*,V))J-^(P*,V) (136) 
const x (xf^ 3 |yp 1/3 , (137) 



as x — > at fixed, nonzero y. This comparatively slow power-law divergence as the separatrix 



is approached at (small) nonzero y is to be contrasted with the K(x, 0) ~ const x \x 
divergence that occurs when the saddle point is approached along the x-axis. (See (|62"D.) 
It is susceptible to numerical test. 

In Fig. [TB| we graph K = K(x, y) as a function of x, at y — 0.05, for the critical version 
of the standard double well model with p = 1.6. (This is the same model whose instanton 
trajectories are shown in Fig. 0(d).) The curve is fitted to high accuracy by a power- law 
const x x~ ' 533 , i. e., const x x"^/ 3 ? This confirms the prediction of our scaling theory. No sub- 
scaling regime is evident at small \x\. Similar plots can be obtained for the near-separatrix 
behavior of K in critical models with other values of p. 

We conclude that the asymptotic approximations to = W^ x \p x ,y) and K^ x ' = 

K^ x \p x ,y) derived from our scaling theory have a wide domain of validity, and may be 
employed in the Maslov-WKB method. 



8.3 Scaling Beyond the WKB Approximation 

We can now compute the Maslov-WKB boundary layer approximations to the stationary 
density p and the quasistationary density pi near the nascent cusp at the saddle point, where 
the conventional WKB approximation breaks down. The boundary layer approximations are 
determined by ( |lf8| ) and (|130|) , the asymptotic approximations to the Legendre-transformed 
action W^ x > and the transformed prefactor respectively. As discussed above, these 
approximations should be valid when the eigenvalue ratio p = \X y \/X x satisfies 3/4 < p < 3. 
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Figure 13: Behavior of the WKB prefactor near the y-axis separatrix, in a critical version 
of the \i = 1.6 standard double well model. K = K(x,y = 0.05) is plotted on a logarithmic 
scale, revealing the scaling behavior (K oc x^^l 3 at nonzero y). 



Substituting ( |1 18[ ) and ( |130] ) into the one-dimensional diffraction integral flBTD yields the 
rather complicated expression 



K(x,y)exp[-W(x,y)/e] 

~ e -V2 | K^{p x , y) exp { [-xp, + W (x \p x , y)] /e} dp, 



(138) 



-1/2 -H/(0,0)/ei ,|-l/3 p -|A y | ? ; 2 A 



const x e 



x / ljpa-1 ^ /3 exp 



y\ 



4A, 



o p ^ 4/3 |p,| 4/V3 + *p 




which requires a bit of explanation. The first problem to be resolved is the lengthscale 
near (x, y) = (0, 0) on which this diffraction integral defines a valid Maslov-WKB approxima- 
tion. The p 2 and xp x terms in the argument of exp(-) are 0(1) when pi and xp x are O(e); i.e., 
when x and p x are 0(e 1 ^ 2 ). The term C Pxjy y^ 3 \p x \ i ^ 3 / e is 0(1) when, also, y = 0(e 3//4_/i//2 ). 
This is the case when [i < 3/2, at least. If [i > 3/2 then the C Px>y y^ 3 \p x \ 4 ^ 3 J e term is 
negligible whenever y = o(l). We conclude that in the weak- noise (e — > 0) limit of models 



with /i < 3/2, the diffraction integral ( |138|) defines a valid Maslov-WKB approximation on 
the x = 0(e 1 / 2 ), y = 0(e 3 / 4 ~ M//2 ) lengthscale near the saddle point. This is precisely the caus- 
tic lengthscale x = 0(|2/|( 3 / 2_/ ^ _1 ), i.e., p x = 0(|y|( 3 / 2 ~^ _1 ), of the last section. If /i > 3/2, 
so that no caustic is present, then y = 0(e 3 / 4_M / 2 ) must be replaced by y = o(l). On the 
appropriate lengthscale, the diffraction integral defines a noncanonical diffraction function. 
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The diffraction integral, being one- dimensional, cannot resolve the singularity at y = 0. 
This is because it does not include an integration over p y . So one cannot expect the Maslov- 
WKB approximation to be valid at arbitrarily small y. The stationary and quasistationary 
densities in critical models are expected to be tightly concentrated on the y = 0(e 1 ^ 2 ) 
transverse lengthscale near the saddle point, as we saw in (|29|)-(|3T|) (which apply in the 
absence of focusing). At most, the Maslov-WKB approximation will be valid in the far field 
of the y = 0(e 1 ^ 2 ) lengthscale, where the factor e~^ Xy ^ y > e is exponentially small. So it 
will not be directly comparable to (|29|)-(|3"T|) . But it proves to be very useful nonetheless. 
Define 'stretched' variables X = x/e 1 ^ 2 and Y = y/e 1 ^ 2 ; also, change the integration variable 
to P x = Pz/e 1 / 2 . The approximation becomes 

const x e -(AH-i)/6 e -W(0,0)A |y|-V3 e -]A a |Y' J | Pai |-M/3 e -xp„ e -J*/4A. dp ^ (139) 

since when p x ,y = 0(e 1//2 ), the term C Px ^y A ^ \p x \^^ A is negligible. 

At this point we must explain how to interpret the integration over P x , or p x . The 
stationary density po(x,y) = po(Xe 1 ^ 2 ,Ye 1 ^ 2 ) must be even in X, and the quasistationary 
density pi(x, y) = pi(Xe 1 / 2 , Ye 1 / 2 ) must be odd. To get approximations with these symmetry 
properties, we may integrate P x from — oo to oo and to oo respectively. We remarked at 
the end of Section || that performing a half-range integration is one way of incorporating an 
antisymmetry constraint, and that is the technique we shall use. 

As summarized in Abramowitz and Stegun (Ref. 0, §19.5), definite integrals resem- 
bling ( |139[ ) define parabolic cylinder functions. Evaluating the integral ( |139j ), with the two 
possible choices of the range of integration (full range and half range), yields the Maslov- 
WKB approximations 

Po(x,y) ~ const x e-^ +1 )/ 6 F (AyVe 1 / 2 )e +A ^ 2/£ |y/e 1/2 p 1/3 e- |A! ' ls/2/£ , (140) 
Pl (x,y) ~ const xe-^ +1 )/ 6 F 1 (AyV e 1 / 2 )e +A ^ 2 / £ | 2 //e 1 / 2 |~ 1/3 e-' A -^/ £ (141) 

to the stationary and quasistationary probability densities, on the 0(e 1 ^ 2 ) lengthscale near 
the saddle point. Here the so-called boundary layer functions F, = F^Z), where Z = 
\WX = X^x/e 1 / 2 , are defined by 

F % {Z) = y i+l (l/2 - /i/3, 2 l l 2 Z)e- z2 ' 2 , (142) 

in the notation of Abramowitz and Stegun. ?/i(l/2 — /x/3, •) and 2/2(1/2 — /x/3, •) are even and 
odd parabolic cylinder functions, respectively. We could equally well define the boundary 
layer functions Fi in terms of an Hermite function of non-integer index, by 

F (Z) = [H W3) -i(Z)e- z2 ] even , F X {Z) = [H^^e- 22 ]^ . (143) 

Here [-]even and [-] dd signify even and odd parts, under the reflection Z —Z. The defi- 
nitions ( 143|) are meaningful whenever the index n = (/i/3) — 1 is not an integer, in which 
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case the Hermite function H n (Z) is not a conventional Hermite polynomial, and is neither 
even nor odd. But since we are assuming 3/4 < p < 3, this is always the case. Irrespective 
of the choice of definitions, 

F (Z) ~ const x |Z|^ /3 , 

F X {Z) ~ const x |Z|~ M/3 sgnZ, (144) 

as Z -> ±oo. (Cf. Ref. 0, §19.8.) 

We stress that the Maslov-WKB approximations to po an d Pi are strictly valid only in 
the transverse far field, i.e., as Y = y/e 1 ^ 2 — > ±oo. But they make very clear how critical 
double well models differ from non-critical double well models. By comparing ( |140|) - (|1 11| ) 
with (|29|)-(|3lD, we see that at criticality, the boundary layer functions F (-) and Fx(-) replace 
the boundary layer functions 1 and erf(-) respectively. The approximations ( |140| )-( |T4"1| ) are 
guaranteed to match to the standard WKB approximation K(x) exp[—W(x)], as one moves 
in a transverse direction away from the saddle point. For example, the |Z|~ M / 3 falloff of 
eqs. (|144 ) will match to the | a; | prefactor falloff of eqs. ( |125| ) and (|127 ), which is seen, 



e.g., in Figure |T3| . 

The Maslov-WKB approximations to po and pi, and the nonpolynomial normal form for 
the Legendre-transformed action that engendered them, have several striking consequences 
for double well models at criticality. 

• A nongeneric caustic, emerging sideways from the nascent cusp at the saddle. In Sec- 
tion |8.1| we predicted from the nonpolynomial normal form for W that when 3/4 < 

p < 3/2, a caustic is located at \x\ ~ const x |y|( 3 / 2 ^) \ Our prediction was con- 



firmed by Figure [11]. This caustic has an unusual (continuously varying) exponent. 



It is nongeneric, in the sense of singularity theory. 

An unusual (continuously varying) singularity index. As e — > 0, the falloff of the 
stationary density p at the saddle point (0, 0) is not pure exponential, on account of 
the e~^ +1 ^ 6 prefactor in the Maslov-WKB approximation ( |140| ). This is interpreted 
as a statement that the nascent cusp has singularity index s = s(p) = (p + l)/6, 
as mentioned in Section [|. It too is nongeneric, in the sense of singularity theory. 

Non-Arrhenius MFPT asymptotics. If one computes the rate at which the quasi- 
stationary density pi is absorbed on the separatrix near the saddle, the e~^ +1 ^ 6 pre- 
factor in the Maslov-WKB approximation (|141|) will appear in the e — > asymp- 
totics. Equivalently, the exponentially decaying eigenvalue Ai = Ai(e) of the Smolu- 
chowski operator will have an asymptotic e~^ +1 ^ 6 prefactor, as well as the usual 
Arrhenius factor [i.e., exp(— AW/e)]. And the MFPT will be asymptotic to const x 
g+(/x+i)/6 exp(+AW/e), as e — >• 0. At criticality, the weak-noise growth of the MFPT is 
slower than pure exponential. 

A non-Gaussian limiting exit location distribution. In the absence of MPEP bifurca- 
tion, for a symmetric double well model the location of the point of exit from either 
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of the two wells would have an asymptotic Gaussian distribution, on the transverse 
0(e l l 2 ) lengthscale near the saddle. In fact, its density would fall off as e"^ Xy ^ y / £ . 
We see from the Maslov-WKB approximation to p\ that at criticality, the exit location 
density on the separatrix includes scaling corrections . In the transverse far field it falls 
off as |y| -1 /3 e -|Ai/|i/ l e , rather than e^ Xy ^ v It has a non-Gaussian tail. 

These phenomena look natural from the point of view of the theory of critical phenomena, 
though the stochastic escape problem has not previously been considered from that point of 
view. 

9 Discussion 

We can now step back and review our results. We began with a WKB treatment of the 
weak-noise asymptotics of stationary (and quasistationary) solutions of the Smoluchowski 
equation. The WKB analysis led to instanton trajectories, which have a physical interpreta- 
tion as most probable weak-noise fluctuational paths. The instanton trajectories turned out 
to be zero-energy trajectories of an associated Hamiltonian dynamical system. This is be- 
cause the phase space versions of the instanton trajectories (i.e., WKB bicharacteristics) 
trace out a Lagrangian manifold in phase space. In double well models the onset of bifurca- 
tion is associated with the fleeting appearance of an unusual singularity (a nascent cusp) in 
the shape of this manifold, as the parameters of the model are varied. 

There is a formal analogy between the Lagrangian manifold of a dynamical system per- 
turbed by weak noise, and the thermodynamic surface of a condensed matter system. This 
analogy led us to construct a scaling theory of the shape of the Lagrangian manifold near 
the nascent cusp. To date, most work on Lagrangian manifolds has assumed that they are 
smooth, and that any apparent singularities in their shape can be transformed away by 
a change of coordinates. This is analogous to assuming that thermodynamic surfaces are 
real analytic, and that non-analyticities in thermodynamic behavior (i.e., phase transitions) 
can be transformed away by working in terms of the appropriate thermodynamic potential. 
Equivalently, it is analogous to assuming that all phase transitions have classical critical 
exponents. Our scaling theory makes it clear that the nascent cusp singularity is a gen- 
uine point of non-smoothness of the Lagrangian manifold. In thermodynamic terms, it has 
nonclassical, indeed continuously varying, critical exponents. 

Applying the Maslov-WKB method to the nascent cusp yielded several interesting pre- 
dictions, which we summed up in the four bulleted items at the end of the last section. One 
normally expects that in a double well system perturbed by weak noise of strength e, the rate 
of inter- well hopping Ai = Ai(e) will be asymptotic to a constant multiple of the Arrhenius 
factor exp(— AW/e), where AW is an effective barrier height. Also, one expects that the dis- 
tribution of exit locations (from either well) will asymptotically become a Gaussian of 0(e 1 ^ 2 ) 
standard deviation, centered on the saddle point between the two wells. The Maslov-WKB 
method predicts that at criticality, both these phenomena are strongly altered. In particular, 
the factor exp(— AW/e) must be replaced by e~ s exp(— AW/e), where s — (fx + l)/6 is the 



57 




Figure 14: A sketch, on a logarithmic scale, of the rate of noise-activated inter-well hopping Ai 
as a function of the reciprocal noise strength 1/e. Off criticality (solid curve), Ai displays a 
pure exponential falloff as e — > 0, i.e., Ai ~ const x exp(— AW/e). At the onset of bifurcation 
(dashed curve), Ai ~ const x e~ s exp(— AW/e) instead, where s is the singularity index of 
the nascent cusp appearing at the saddle point. 

singularity index of the nascent cusp. (As we noted in Section [B~3"l , the singularity index is a 
sort of critical exponent.) In Fig. |14| we sketch an Arrhenius plot, showing this anomalous 
(non-Arrhenius) behavior. 

In mathematical terms, the nascent cusp appearing at the onset of bifurcation is a non- 
generic singularity, i.e., a singularity different from any of the now classical singularities of 
catastrophe theory. As shown in Fig. |], in many double well models it induces an unusual 
caustic in the flow field of instanton trajectories. This caustic is itself nongeneric, in that its 
exponent is not equal to 3/2. As we have seen (see, e.g., Fig. |T2|), its presence quantitatively 
confirms the validity of our scaling theory. It is remarkable that such nongeneric phenomena 
are a generic feature of singly parametrized symmetric double well models. 

At least as developed in this paper, our scaling theory is a scaling theory of weak- 
noise behavior near the nascent cusp, precisely at criticality. It would be useful to treat 
as well models that are nearly critical, but not exactly so. Such models should display 
a crossover from non-Arrhenius behavior to Arrhenius behavior at sufficiently weak noise 
strength. By developing a joint scaling theory, one of the variables in which measures the 
distance from criticality, it should be possible to analyse this phenomenon. We expect that it 
is possible to derive a 'Ginzburg criterion' |27|, expressing how close to criticality any given 



double well model should be, for the non-Arrhenius behavior of Fig. 14 to be visible. Work 
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on this is under way. 

We briefly mention two geometric features of models 'off criticality' that cry out for a 
theoretical explanation. A nongeneric caustic appears near the separatrix not only at criti- 
cality, but in many non-critical models as well fl2f . Also, as criticality is approached (e.g., as 
a — ► a~ in the standard model of (§)), it frequently happens that the nascent cusp is formed 
by a collision of two generic cusps, which move along the separatrix toward the saddle point. 
These phenomena can presumably be explained by an appropriate joint unfolding, but that 
is for the future. 

We close by mentioning a possible extension of a more theoretical sort. In this paper 
we have focused exclusively on the asymptotic solutions of the time-independent weak-noise 
Smoluchowski equation. There is reason to believe that nongeneric singularities resembling 
the nascent cusp can occur, and are perhaps even widespread, in the asymptotic solutions 
of other singularly perturbed elliptic partial differential equations. Most WKB treatments 
of singularly perturbed elliptic PDE's (see, e.g., Duistermaat [|IU|) assume that each WKB 
characteristic (i.e., instanton trajectory) eventually leaves any bounded region of space. This 
assumption is violated in the Smoluchowski equation for any double well model, since the 
MPEP(s) terminate on the saddle point, rather than extending to infinity. We expect that 
when it is violated in other PDE's, analogous nongeneric singularities in formal asymptotic 
solutions can occur. The nongeneric singular phenomena that we have seen in this paper 
may simply be representatives of a larger class. 
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